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PEPTIDE SYNTHETASE GENE CPSl 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation in part application of U.S. 
Application Serial No. 09/448,215, filed November 23, 1999. 

5 STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

The present invention was made with support from the United 
States Government under Grant No. 96-35303-3198 from the USDA/NRI. The 
United States Government may have certain rights in the invention. 

FIELD OF THE INVENTION 

1 0 The present invention relates to nucleic acid molecules encoding 

peptide synthetase homologs of Cochliobolus heterostrophuSy Pyrenophora teres, 
Fusarium graminearium, and AUernaria solani and uses thereof. 

BACKGROUND OF THE INVENTION 

There are approximately 30 species included in the genus 

1 5 Cochliobolus, nearly all of which are pathogens of wild grasses or cereals (Voder 
et al, ''Cochliobolus spp. And Their Host-Specific Toxins, in Carroll, eds., The 
Mvcota Vol. 5: Plant Relationships. Part A , Berlin: Springer- Verlag, pp. 145-166 
(1997)). Cochliobolus heterostrophus represents the most widely distributed 
species in the genus and can be found in many tropical and subtropical areas in the 

20 world. As a natural pathogen of com, C. heterostrophus causes a disease 

frequently called leaf spot of maize in the old literature (Drechsler, "Leafspot of 
Maize Caused by Ophiobolus Heterostrophus n. sp., The Ascigerous Stage of a 
Helminthosporium Exhibiting Bipolar Germination," J. Agr. Res. . 3 1:701-726 
(1925); Drechsler, "Phytopathological and Taxonomic Aspects of Ophiobolus, 

25 Pyrenophora, Helminthosporium, and a New Genus, Cochliobolus^ PhvtopathoL, 
24:953-983 (1934); Yu, "Studies on Helminthosporium Leaf Spot of Maize," 
3:273-3 18 (1933); Orillo, "Leafspot of Maize Caused by Helminthosporium 
maydis:' 36:327-395 (1952). In the United States, C heterostrophus is usually 
found in the warmer southern states, thus, the disease is commonly known as 

30 Southern Com Leaf Blight (Hooker, "Cytoplasmic Susceptibility in Plant 
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Disease," Ann. Rev. PhvtODathoL , 12:167-179 (1974)). For many years, Southern 
Com Leaf Blight was only known as an endemic disease and was not considered 
to be major economic importance in the United States. But in 1970, it suddenly 
broke into a severe epidemic that destroyed 15% of the U.S. com crop and caused 
5 losses estimated at more than $1 billion. This serious damage made Southern 
Com Leaf Blight one of the most v^dely known crop diseases in the U.S. 

Prior to the outbreak of the disease, only one race of C 
heterostrophus (race 0) was known in the field. In late 1969 when the disease 
became an epidemic, a new race of the fungus was identified from infected com 

1 0 leaves collected in severely diseased areas. It was soon designated as race T 
because of its high vimlence on T-cytoplasm com and the ability to produce a 
phytotoxin called T-toxin, which specifically affects T-com. In contrast, race O 
does not produce T-toxin and is mildly virulent on both T-cytoplasm and N- 
cytoplasm (normal cytoplasm) com (Hooker et al., "Physiological Races of 

1 5 Helminthosporium maydis and Disease Resistance," Plant Pis. Reptr., 54:11 09- 
1110 (1970); Scheifele, "Cytoplasmically Inherited Susceptibility to Diseases as 
Related to Cytoplasmically Controlled Pollen Sterility in Maize," 25:1 10-138 
(1970); Smith et al, "Physiologic Races oi Helminthosporium maydis," 54:819- 
822 (1970); Yoder et al., "Segregation of Pathogenicity Types and Host-Specific 

20 Toxin Production in Progenies of Crosses Between Races T and O of 

Helminthosporium maydis {Cochliobolus heterostrophus)"" Phytopathology, 
65:273-276 (1975); Yoder, "Evaluation of the Role oi Helminthosporium maydis. 
Race T Toxin in Southem Com Leaf Blight, in Tomiyama, eds., Biochemistry and 
Cytology of Plant Parasite Interaction , New York, New York:Elsevier, pp. 16-24 

25 (1976); Yoder, "Toxins in Pathogenesis," Ann. Rev. Ph vtopathol., 18:103-129 
(1980)). T-cytoplasm stands for Texas male sterile cytoplasm, a unique 
cytoplasm with a trait for maternally inherited male sterility, characterized by the 
failure to produce pollen (Levings, "The Texas Cytoplasm of Maize: Cytoplasmic 
Male Sterility and Disease Susceptibility," Science , 250:942-947 (1990)). T- 

30 cytoplasm com was widely used for hybrid seed production and breeding to avoid 
hand or mechanical emasculation in the 1950s and the 1960s. It was the 
coexistence of large acreages of intensively planted T-cytoplasm com and the 
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sudden appearance of race T of C heterostrophus that resulted in the epidemic of 
the disease in 1970. This discovery first opened the door to understanding 
pathogenesis by C. heterostrophus. 

Early genetic analysis suggested that both T-toxin production and 
5 high virulence on T-cytoplasm com are controlled by a single geneUc locus 

defined as Toxl (Leach et al., "Dominance at the Toxl Locus Controlling T-Toxin 
Production by CocMiobolus heterostrophus" Phvsiol. Plant Pathol., 21:327-333 
(1982)). This was demonstrated by crosses between race T and race O in which 
only parental phenotypes segregated in a l:i ratio (Tox+:Tox-); all T-toxin 
1 0 producing progeny are highly virulent on T-cytoplasm corn while all T-toxin 
nonproducing progeny are weakly virulent (Yoder et al., "Segregation of 
Pathogenicity Types and Host-Specific Toxin Production in Progenies of Crosses 
Between Races T and O of Helminthosporium maydis {CocMiobolus 
heterostrophus). Phytopathology . 65:273-275 (1975); Leach etal., "Dominance at 
1 5 the Toxl Locus Controlling T-Toxin Production by CocMiobolus heterostrophus," 
Phvsiol. Plant Pathol. . 21:327-333 (1982)). Further investigation by comparison 
of electrophoretic karyotypes and chromosome-specific DNA hybridizations 
indicated that Toxl is tightly linked to a reciprocal translocation breakpoint and is 
associated with as much as a megabase of DNA (mostly highly repeated and A+T- 
20 rich) that is missing in race O (Bronson, "Ascospore Abortion in Crosses of 
CocMiobolus heterostrophus Heterozygous for the Virulence Locus Toxl," 
Genome . 30:12-18 (1988); Tzeng et al., "A Restriction Fragment Length 
Polymorphism Map and Electrophoretic Karyotype of the Fungal Maize Pathogen 
CocMiobolus heterostrophus," Genetics . 130(l):81-96 (1992); Chang et al.. "A 
25 Reciprocal Translocation and Possible Insertion(s) Tightly Associated with Host- 
Specific Virulence in CocMiobolus heterostrophus," Genome, 39(3):549-557 
(1996)). Surprisingly, recent analyses of several Tox' mutants revealed that Toxl 
is not a single locus but rather two loci, each on a different translocated 
chromosome (Yoder et al., "Molecular Determinants of the Plant/Fungus 
30 Tnt"^^''*i»" " K-^hmntn, ftH<! Hnst-Specific Tnvin: Riosvnthe.sis. Receptor and 
Molecular Biology. Tottori, Japan:Faculty of Agriculture, Tottori Univ., pp. 23-32 
(1994); Turgeon et al., "Function and Chromosomal Location of the CocMiobolus 
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heterosirophus Toxl Locus," Can. J. Bot., 73 (suppl. 1 Sect. E-H):S1071-S1076 
(1995)). These two Toxl loci have been designated ToxJA and ToxJB (Yoder et 
al., ''Cochliobolus spp, and Their Host-Specific Toxins " in Carroll, eds., Jhe 
MvcQta Vol. 5: Plant Relationships. Part A , Berlin: Springer- Verlag, pp. 145-166 
5 (1997)). Two genes PKSl and DECl have been cloned from the two loci 

respectively; both are required for biosynthesis of T-toxin and are found only in 
race T isolates of C heterostrophus (Yang, "The Molecular Genetics of T-Toxin 
Biosynthesis by Cochliobolus heterostrophus ^ Ph.D. Thesis, Cornell University 
(1995); Yang et al., "A Folyketide Synthase is Required for Fungal Virulence and 
10 Production of the Polyketide T-Toxin," Plant Cell 8(1 1):2139-2150 (1996); Rose 
et al., "A Decarboxylase Required for Poloyketide Toxin Production and High 
Virulence by Cochliobolus heterostrophus;' 8'*^ Int. Svmp. Mo!. Plant-Microbe 
Int, Knoxville, p. J-49 (1996)). 

Genetic analysis also suggested that T-toxin is required by C 

1 5 heterostrophus for its high virulence on T-cytoplasm com. This hypothesis was 
first tested by the generation of induced T-toxin deficient mutants using different 
mutagenesis procedures. All mutants with a tight Tox" phenotype cause disease 
symptoms that are indistinguishable from those caused by race O when tested on 
both T and N-cytoplasm com, suggesting that T-toxin is indeed a vimlence factor 

20 (Yang et al., 1992; Lu et al., "Tagged Mutations at the Toxl Locus of 

Cochliobolus heterostrophus Using Restriction Enzyme-Mediated Integration," 
Proc. Natl. Acad. Sci. USA . 91:12649-12653 (1994); Rose et al., "A 
Decarboxylase Required for Polyketide Toxin Production and High Vimlence by 
Cochliobolus heterostrophus;' 8^^ Int. Svmp. Mol. Plant-Microbe Int., Knoxville, 

25 p. J-49 (1996)). This conclusion was firmly supported by the site-specific 

dismption of the PKSl or DECl in the wild type race T genome; dismptants lost 
the ability to produce T-toxin and caused race 0 type symptoms on both T-com 
and N-com (Yang et al., "A Polyketide Synthase is Required for Fungal Vimlence 
and Production of the Polyketide T-Toxin," Plant Cell 8(1 1):2139-2150 (1996); 

30 Rose et al., "A Decarboxylase Required for Polyketide Toxin Production and 
High Vimlence by Cochliobolus heterostrophus;' Int. Svmp . Mol. Plant- 
Microbe Int. . Knoxville, p. J-49 (1996)). These experiments have given a very 
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clear resolution for the role of T-toxin in pathogenesis. They also implied that 
pathogenesis by C. heterostrophus must involve additional pathogenicity factors 
because race O which does not produce T-toxin and race T-derived Tox- mutants 
are effective pathogens on com. 

5 A number of fungal molecules have been identified as general 

pathogenicity or virulence factors in several plant pathogenic fiingi (Yoder et al., 
"Molecular-Genetic Evaluation of Fungal Molecules for Roles in Pathogenesis in 
Plants," J. Genet.. 75(3):425-440 (1996)). These include potential penetration 
factors such as melanin (Guillen et al., "Linkage Among Melanin Biosynthetic 

1 0 Mutations in Cochliobolus heterostrophus," Funpal Genet. Newsl.. 4 1 :4 1 -42 
(1994)), cutinase (Oeser et al., "Pathogenesis by Cochliobolus heterostrophus 
Transformants Expressing a Gene Encoding Cutinase from Nectria 
haematococca" Mnl Plant-Microbe Int. . 7:282-288 (1994)) and 
polygalacturonase and xylanase (Lyngholm et al., "Mutants of Cochliobolus 

1 5 heterostrophus Deficient in Extracellular Enzymes," Funpal Genet. Newsl., 42:46- 
47 (1995)) or possible mechanisms involved in colonization such as phytotoxin 
detoxification (Schafer et al., "One Enzyme Makes a Fungal Pathogen, But Not a 
Saprophyte, Virulent on a New Host Plant," Science . 246:247-249 (1989)) or 
components of signal transduction pathways (Horwitz et al., "A G Protein Alpha 

20 Subunit Gene From the Com Pathogen Cochliobolus heterostrophus is Involved 
in Two Complex Developmental Pathways: Mating and Appressorium Formation 
(unpublished) (1997)). Although C. heterostrophus is known to produce a 
nonhost specific toxin called ophiobolin (or cochliobolin), a Cjs sesterterpenoid 
compound, which is toxic to many organisms, including plants, bacteria, fungi and 

25 nematodes, there is no evidence that ophiobolins are involved in pathogenesis by 
C. heterostrophus or other phytopathogenic fungi. No other pathogenesis-related 
toxins have been isolated from C. heterostrophus so far, but studies on closely 
related Cochliobolus species and other phytopathogenic fungi suggest that 
pathogenesis by this group of fungi also involves peptide toxins. 

30 Four peptide phytotoxins (victorin, HC-toxin, AM-toxin, and 

enniatins) have been characterized as pathogenicity or vimlence factors. They are 
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all small cyclic peptides (4-6 residues), containing unusual amino acids or 
hydroxy acids, and they can be either host specific or non-host specific in terms of 
plant toxicity. A number of peptide phytotoxins are believed to be synthesized 
nonribosomally. Early in the 1960s, several biochemists working on the bacterial 
5 peptide antibiotics gramicidin and tyrocidine found that these polypeptides can be 
synthesized in RNAase-treated particle-free extracts of Bacillus brevis that are 
known to produce the same antibiotics; adding protein-synthesis inhibitors to the 
extracts does not affect this process. This indicated the existence of a peptide 
biosynthetic system in which ribosomes and mRNAs are not needed. Further 

10 studies revealed that in this system, peptides are synthesized on a protein-template 
and this template itself is a multifunctional enzyme or a complex of several such 
enzymes, collectively called peptide synthetases, catalyzing the biosynthetic 
process (Laland et al., "The Protein Thiotemplate Mechanism of Synthesis for the 
Peptide Antibiotics Produced by Bacillus Brevis,"" Essays in Biochemistry , 7:31- 

15 57(1 973); Lipmann, "Bacterial Production of Antibiotic Polypeptides by Thiol- 
Linked Synthesis on Protein Templates," Adv. Microbiol. Physiol., 21 :277-266 
(1980)), 

Peptide synthetases can catalyze biosynthesis of a variety of 
peptides. In terms of bioactivity, they can be antibiotics, enzyme inhibitors, plant 

20 or animal toxins and immunosuppressants (Stachelhaus et al., "Modular Structure 
of Peptide Synthetases Revealed by Dissection of the Multifunctional Enzyme 
GrsA.," Journal of Biological Chemistry, 270(1 1):6163-6169 (1995)). In terms of 
chemical structure, they can be either linear (i.e. ACV, the penicillin precursor and 
gramicidin) or cyclic (most are). The latter can be further classified into three 

25 subgroups: 1) The "standard" cyclic peptides (i.e. gramicidin S, tyrocidine, HC- 
toxin and cyclosporin); 2) cyclic lactones (i.e. destruxin); 3) cyclic depsipeptides 
(i.e. beauvericin and enniatin). There have been over 300 different carboxy 
compounds that can be activated by peptide synthetases. 

Although the first peptide synthetase. Gramicidin S synthetase, was 
30 purified and used for the cell-free synthesis of the peptide early in the 1960s 
(Tomino et al., "Cell-Free Synthesis of Gramicidin S," Biochem. , 6:2552-2560 
(1967)), the first bacterial peptide synthetase gene, tycA, which encodes the 
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tyrocidine synthetase 1 in B. brevis was not cloned until almost twenty years later 
(Marahiel et al., "Cloning of the Tyrocidine Synthetase 1 Gene from Bacillus- 
brevis and Its Expression in Escherichia-coli, Mol. Gen. Genet., 201(2): 1986 
(1985)). Since then, more than twenty peptide synthetase genes have been 
5 reported for both bacteria and filamentous fiingi, but only fourteen have complete 
nucleotide sequences published. All are larger than 3.3 kb and range between 3.3- 
19.5 kb for bacterial genes and 9.4-45.8 kb for fungal ones. Interestingly, all 
fiingal peptide synthetase genes reported lack introns, even the cyclosporin A 
synthetase gene simA, which has a 45.8 kb of open reading frame (the largest 
1 0 genomic ORF so far recorded). Although biosynthesis of bacterial peptides 
differs from that of fungal ones in terms of the number of multifunctional 
enzymes involved, the genes encoding these enzymes are similar to each other in 
both function and structure. Comparison of nucleotide sequences reveals one or 
more highly conserved regions at certain positions in each peptide synthetase 
1 5 gene. These regions formerly called "amino acid activating domains" 

(Stachelhaus et al., "Modular Structure of Peptide Synthetases Revealed by 
Dissection of the Multifunctional Enzyme GrsA," .Toumal of Biological 
Chemistrv . 270(1 1):6163-6169 (1995)), now called "amino acid activating 
modules" (Marahiel, "Protein Templates for the Biosynthesis of Peptide 
20 Antibiotics." Chem. Biol. . 4(8):56 1 -567 (1 997)) consist of a set of domains 
(formerly called "modules") believed to have specific functions such as 
recognization, activation and thioesterification of individual constituent amino or 
hydroxy acids, and in some cases methylation and racemation for modification of 
certain residues before incorporation into the peptide chain (Stachelhaus et al., 
25 "Modular Structure of Peptide Synthetases Revealed by Dissection of the 

Multifunctional Enzyme GrsA," Journal of Bio logical Chemistry. 270(1 1):6163- 
6169 (1995)). The most convincing evidence supporting this assignment is that ir 
most cases, the number of conserved functional units in each gene or gene cluster 
is equal to the number of amino acids in the respective peptide. This one-for-one 
30 match is very clear between three of four fungal peptides and their biosynthetic 
genes. The total number of modules in three of four bacterial gene clusters also 
matches the number of amino acids in the respective peptides. 
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Sequence alignment of amino acid-activating modules reveals 
strictly conserved sequence motifs that contain active residues for module 
functions. These motifs are called "core sequences" (Marahiel, "Multidomain 
Enzymes Involved in Peptide Synthesis/* FEBSLett, 307(l):40-43 (1992)). A 
5 minimal amino acid-activating module must contain six core sequences, whose 
functions (except for core 1) have been proposed based on mutational analysis of 
several peptide synthetases. Core sequences 1-5 are grouped into an amino acid 
adenylation domain and core 6 is a thioester formation domain (Figure 1 A). All 
bacterial peptide synthetase genes contain "type I modules" - the minimal amino 

1 0 acid activating modules which were previously called "type I domains" 

(Stachelhaus et al "Modular Structure of Peptide Synthetases Revealed by 
Dissection of the Multifunctional Enzyme GrsA," Journal of Biological 
Chemistry , 270(1 1):6163-6169 (1995)). Two fungal genes, acvA and HTSl also 
have this modular structure. In addition to the type I module, two fungal genes, 

1 5 esynJ and simA, contain type II modules, in which an insertion (about 400 amino 
acids) is found between cores 5 and 6 of a normal type I module. This region 
contains a motif (VLE/DXGXGXG (SEQ. ID. No. 1)), highly conserved in S- 
adenosyl-methionine (SAM)-dependent methyltransferases, hence, it is refened to 
as a A^-methylation domain (Figure lA). Additional evidence for 

20 methyltransferase activity of this module is that the number and position of type II 
modules in esynl and simA exactly match that of iV-methylated amino acids in 
ennatin and cyclosporin sequences (Figure IB). 

Although the modular structure described above is highly 
conserved among most peptide synthetase genes, some variations have been found 

25 in the latest cloned peptide synthetase gene sq/B, which is the first gene in the 
saframycin Mxl synthetase gene cluster (Pospiech et al., "A New Myxococcus 
xanthus Gene Cluster for the Biosynthesis of the Antibiotic Saframycin Mxl 
Encoding a Peptide Svnthetase," Microbiology. 141(8): 1793- 1803 (1995)). safB 
contains two type 1 amino acid activating modules. One module has all six highly 

30 conserved core sequences, but another, believed to activate alanine (the first 

amino acid in the linear tetrapeptide precursor of saframycin Mxl), lacks core 5 
and has a weakly conserved core 1 (Pospiech et al., "Two Multifunctional Peptide 
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Synthetases and an O-methyllransferase Are Involved in the Biosynthesis of the 
DNA-Binding Antibiotic and Antitumour Agent Saframycin Mxl from 
MyxococcusxarithusrMxcromQgl. 1 42(4) :74 1-746 (1996)) (Figure lA). This 
suggests that some of the motifs in the amino acid adenylation domain are 
5 dispensable or not critical for domain function. It also raises the possibility that 
other variations might be found in yet unknown peptide synthetase genes. 

Although C helerostrophus has been a model eukaryotic plant 
pathogen since the 1970s, most molecular genetic analyses conducted in this . 
system have focused on production of the polyketide T-toxin by race T isolates of 

10 the fungus. Solid evidence now indicates that T-toxin is a host-specific virulence 
factor in Southern Com Leaf Blight (Yoder et al., "Molecular-Genetic Evaluation 
of Fungal Molecules for Roles in Pathogenesis in Plants " J. Genet., 75(3):425- 
440 (1996); Yoder et al., Cochliobolus spp, and Their Host-Specific Toxins, 
Carroll eds., The Mvcota Vol. 5: Plant Relationships. Part A, Berlin: Springer- 

15 Verlag, pp. 145-166 (1997)). It is clear, however, that C. heterostrophus needs 
additional factors, presumably general factors for pathogenesis to corn plants, 
since race O, which does not produce T-toxin, can be an effective com pathogen. 
Attempts to identify additional general factors required by C. heterostrophus for 
pathogenesis have been unsuccessful. Cloning and characterizing additional C. 

20 heterostrophus genes that control biosynthesis of novel fungal molecules involved 
in critical pathogenic processes may be important because it offers potential 
targets for the design of products that might interfere with the com plant infection 
process. The present invention is directed to achieving these objectives. 

SUMMARY OF THE INVENTION 

25 The present invention relates to an isolated nucleic acid molecule 

encoding CPSl peptide synthetase homologs. The DNA molecule comprises a 
nucleotide sequence which hybridizes to a DNA molecule having a sequence as 
set forth in at least one of SEQ. ID. No. 2, SEQ ID No. 41, SEQ ID No. 43, or 
SEQIDNo.45. 

30 Another aspect of the present invention relates to a method for 

identifying inhibitors of a CPSl protein or polypeptide which involves providing 
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the CPSl protein or polypeptide, contacting the protein with potential inhibitor 
compounds, determining peptide synthetase activity, and selecting compounds 
which decrease the peptide synthetase activity. 

Still another aspect of the present invention relates to a method of 
5 imparting disease resistance to a plant by overexpressing a CPS 1 protein or 
polypeptide in the plant cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides the structure of amino-acid activating modules 
identified in peptide synthetase genes (adapted from Stachelhaus and Marahiel, 

10 1995; Pospiech 1995; Marahiel, 1997). Figure lA shows the domain 

arrangements in two types of modules. Structural variations in the first module 
(safBl) of the gene safB sat also indicated below type I. Figure IB shows the 
correlation between module types and the nature of residues in two fungal 
peptides. Open box: type I module; filled box: type II module. Each peptide 

1 5 sequence is given below. 

Figure 2 depicts the recovery of DNA flanking the REMI vector 
Insertion site (arrows) in mutant R.C4.2696. Circled numbers indicate restriction 
enzyme sites used for recovery of each plasmid. p214B7 contains 4.2 kb flanking 
DNA (3.4 left; 0.7 right); p214Ml contains 0.1 kb left flank that overlaps with 
20 p2 14B7 and 1 . 1 kb right flank that overiaps with p2 1 4S 1 , which contains 3.2 kb 
flanking DNA on the left only. 

Figures 3A-B illustrate the extension of the tagged sequence by 
targeted integration and plasmid rescue. Figure 3 A shows a general scheme 
illustrating chromosome walking strategy (only one direction is indicated). "X" 

25 indicates a restriction enzyme site used for recovery of vector (indicated by "T") 
with flanking genomic DNA ("L" for left flank; "R" for right flank; R' for a 
fragment from "R" that is subcloned into a subsequent transformation vector). 
Each integration site is indicated by a vertical arrowhead pointed at the selectable 
marker on the vector. Overlapped flanking DNA is represented by bars drawn in 

30 the same pattern. Figure 3B shows two strategies used for construction of a 
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targeted integration vector. Left: A genomic DNA fragment (indicated by the 
letter "a") is subcloned into a vector directly and the subsequent integration occurs 
by a single crossover ( "a" is duplicated). Right: "a" is subcloned into a vector via 
multiple steps (see Figure 6 for an example) that allows linearization with 
5 restriction enzyme " Y" and the subsequent integration occurs by a double 
crossover resulting in a simple insertion. 

Figure 4 shows how the targeted integration vector p214SNP was 
constructed. The sites for restriction enzyme digestion are indicated by arrows. 
p214SNP was originally designed for a double crossover integration after digested 
1 0 . with BstXl and Bell, but it was found later that the E, coli strain (DH5a) used for 
amplification of the plasmid is Dam"" which blocks the BcH site (TGA TCA to 
TG'^TCA). As an alternative, the plasmid was linearized with BstXl only and 
used for transformation. 

Figure 5 depicts the recovery of genomic DNA flanking the 
1 5 targeted integration vector p214SNP in transformant #118. The vector integrated 
into the target site by a single crossover that resulted in duplicated HindlW-Sacl 
fragments (indicated by letter b and b ). The second genomic DNA fragment 
canied by the vector is also duplicated in the genome {Narl-HinAWly indicated by 
letter a and a'). Genomic DNA was digested with BgUl (which does not cut the 
20 vector) or BcR (which cuts once in fragment a' on the vector) as indicated by 

numbers in a circle. The two recovered plasmids carry sequences that extend the 
right flank only by 170 and 800 bp respectively beyond the genomic DNA on 
p214Sl. 

Figure 6 shows how the targeted integration vector pi 1 8BSP was 
25 constructed. The sites for restriction enzyme digestion on the vectors are 
indicated by arrows, pi 1 8B 14 contains two duplicated Narl-HindlU-Sacl 
fragments (numbered region, a + b = a' + b') but only one is present in pi 18BSP. 
pi 18BSP was linearized with BgRl and transformed into a wild type C. 
heterostrophus strain. 
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Figure 7 depicts the recovery of genomic DNA flanking the 
targeted integration vector pi 18BSP in transformant #9. The vector integrated 
into the target site by a double crossover. Genomic DNA was digested with Pstl 
and relegated. The recovered plasmid p9P2 includes the entire pUClS sequence 
5 on pi 18BSP and 4.6 kb of genomic DNA that contains all of ORFl (CPSl), 
including the stop codon (TAG) and 3.0 kb of genomic region 3' of the stop 
codon. 

Figure 8 shows how the targeted integration vector pi 18BCS was 
constructed. The sites for restriction enzyme digestion are indicated by arrows. 

10 Figure 9 depicts the recoveiy of genomic DNA flanking the 

targeted integration vector pi 18BSP in transformant #12. The vector integrated 
into the target site by a single crossover that resulted in a duplicated Sspl-Sspl 
fragment in the transformant genome (indicated by the letter "a"). Genomic DNA 
was digested with HindlU (arrows) and ligated. The recovered plasmid pl2H6 

1 5 contains the entire pi 1 8BSP sequence and a 2.1 kb genomic DNA (5acl-///>jdIIl) 
on the left region that overlaps with the sequence carried on p9P2 (see Figure 7). 

Figures lOA-B are photographs which show that the REMI mutant 
R.C4.2696 grows like wild type in culture (Figure lOA) and produces normal 
appressoria (Figure lOB). In Figure lOA, plates containing complete medium 

20 (CM) were inoculated with a conidia-bearing mycelium plug and incubated at 
22'C under warm white light (F40/350BL) (Sylvania Inc., Danvers, MA). The 
photograph in Figure lOA was taken 6 days after inoculation. Left to right: 
Mutant R.C4.2696; wild type. In Figure I OB, conidia of mutant (left) or wild type 
(right) were placed in a drop of water and incubated at 32°C for 6 hrs. No 

25 significant difference in percentage of appressorium-forming conidia (arrows) was 
detected. 

Figures 1 1 A-B are photographs which show that the REMI mutant 
R.C4.2696 produces wild type levels of T-toxin (Figure 1 IB) but has reduced 
virulence on T-cytoplasm com (Figure 1 1 A). In Figure 1 1 A, two week old T- 
30 cytoplasm com plants were inoculated with conidial suspensions of (left to right) 
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mutant R.C42696, wild type race T, and a Tox+ mutant progeny from a cross 
between R.C4.2696 and a race O tester, and symptoms (represented by three 
diseased leaves for each strain) were recorded 5 days after inoculation. Leaves 
infected by wild type race T collapsed (anow) but those infected by mutants 
5 remained alive. In Figure 11 B, a plate containing T-toxin-sensitive E. coli cells 
was inoculated with agar blocks bearing mycelia of three strains (in the same 
order as in Figure 11 A) and incubated at 32°C overnight (the inoculum at bottom 
is race O control). No significant differences in T-toxin production (indicated by 
halos) among the three strains were detected. 

1 Q Figure 1 2 is a photograph which displays that the REMl mutant 

R.C4.2696 produces lesions much smaller than wild type on N-cytoplasm com. 
Two week old N-cytoplasm com plants were inoculated with conidial suspensions 
at the same concentration and symptoms were recorded 7 days after inoculation. 
The mutant (right) produces the same number of lesions as wild type (left) but the 

15 size of lesions was dramatically reduced compared to wild type (arrows). 

Figures 13A-B show that the REMl mutant R.C4.2696 has a 60% 
reduction of virulence compared to wild type. In Figure 13A, lengths of 100 
typical lesions from com leaves inoculated with wild type race O and a mutant 
progeny R45 ijox. hygB") carrying the R.C4.2696 mutation were measured 7 
20 days after inoculation and values plotted. Figure 1 38 shows the statistical 

analysis that 86% of the mutant lesions are less than 4 mm in length (average size 
3.5 mm), 60% reduced compared to that of wild type (8.5 mm). 

Figures 14A-B illustrate the genetic analysis showing that a tagged, 
single site mutation is responsible for the mutant phenotype. Figure 14A is a 

25 photograph of a plant assay where N-cytoplasm com was inoculated with parents 
and progeny indicated in Figure 14B (a complete tetrad from the second cross is 
shown here; random spore assay for the two crosses gave the same results). 
Figure 14B is a table which shows progeny segregation data. In both crosses, 
progeny segregated 1 : 1 for parental type only and all hygromycin B resistant 

30 progeny showed the same small lesion phenotype (represented by progeny 1 -4 in 
Figure 14 A) as the mutant parent (parent 1 in Figure 14 A); all hygromycin B 
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sensitive progeny show the large lesion phenotype (represented by progeny 5-6 in 
Figure 14 A) of the wild type parent (parent 2 in Figure 14A). 

Figure 15 is a restriction map of the cloned sequences surrounding 
the tagged site. Al 1 .3 kb genomic region (thick line) was cloned and completely 
5 sequenced. The original REMI insertion point in the mutant R.C4.2696 is 

indicated by a vertical arrow. The asterisks indicate two targeted integration sites 
in the wild type genome. Two open reading frames (in opposite directions), 
ORFl iCPSl, 5.4 kb) and 0RF2 (TESl, l.l kb) are indicated by open boxes 
below the map (the positions of putative introns are indicated by vertical bars). 
- 1 0 Locations of seven overlapping plasmid clones used for sequencing are indicated 
by thin lines on the top of the map (filled triangles represent the vector sequence 
in each clone). Sequencing strategy is indicated by arrows above each clone line. 

Figure 16 graphically depicts the G+C content of the cloned 
sequences surrounding the tagged site. The 1 1.3 kb of genomic DNA sequence 

1 5 (Figure 1 5) was examined as 200 bp fragments and G+C content of each fragment 
was calculated and plotted. Regions A and C have normal G+C content; regions 
B and D show high G+C content. The overall percentage of G+C in each region 
is indicated by underlined numbers. The positions of ORFl (CPSl) and ORF2 
(TESl) are indicated by arrows and the percentage of G+C of each ORF is given 

20 below (in parentheses). 

Figure 17 displays the nucleotide sequence of CPS! . 5,725 base 
pairs from the 1 1 .3 kb sequenced region (Figure 15) are shown. The deduced 
amino acid sequence of CPS 1 protein is given below the DNA sequence. The 
position of start codon ATG (bold and underlined) is designated +1 and the open 

25 reading frame stops at position 5381 (TAG, in bold and underlined). Five putative 
"CAAT" boxes (bold and indicated by asterisks) are found at the positions -36, - 
58, -67, -172 and -309. Three putative introns (in lowercase with 5' and 3' splice 
sequences in bold; branch sites are underlined) are located at positions 2070-21 14, 
3542-3592 and 4197-4249. Conserved core sequences are shaded. TheGXSXG 

30 motif is boxed. A putative polyadenylation signal is located at position 5604 
(bold and overlined). 
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Figure 1 8 displays the nucleotide sequence of TESL 1 ,901 base 
pairs 5' of CPSl are shown. The deduced amino acid sequence of the TESl 
protein is given below the DNA sequence. The CT motif (bold and indicated by 
asterisks) is found at position -46. A "AT"-rich region (bold and italicized) is 

5 found at position -12. The position of ATG start codon (bold and underlined) is 
designated +1 and the open reading frame (transcribed in the opposite direction 
from CPSl) stops at position 1 153 (TGA, in bold and underlined). One putative 
intron (in lowercase, border sequences are bold; branch sites are underlined) is 
located at position 518-566. A putative polyadenylation signal is located at 

10 position 1 345 (bold and overlined). The putative active site (VHS) is shaded. 

Figures 19A-C are schematic representations which show the 
characterization of modular structure of CPSL Peptide synthetase and 
thioesterase are indicated by open boxes; shaded boxes inside indicate functional 
domains and modules; vertical bars in the shaded boxes indicate highly conserved 

1 5 core sequences. Figure 1 9A illustrates the general structure of bacterial and 
fungal peptide synthetases (adapted from Marahiel 1997, which is herein 
incorporated by reference). A peptide synthetase gene cluster is shown on the top. 
There can be one or more amino acid activating module (cyclosporine synthetase 
has 1 1) in each protein; some peptide synthetases have thioesterase domains (TE), 

20 which can be either integrated into modules or encoded by a separate gene. Each 
synthetase can have type I, type II or both modules. A type I (minimal) module is 
enlarged to show organization of core sequences and domains. Some peptide 
synthetases also have condensation or epimerization domains. Figure 19B 
illustrates the organization of saframycin Mxl synthetase containing 4 amino acid 

25 activating modules (Pospiech et. al., "Two Multifunctional Peptide Synthetases 
and an O-methyltransferase are Involved in the Biosynthesis of the DNA-Binding 
Antibiotic and Antitumour Agent Saframycin Mxl from Myxococcus xanthus,'" 
Microbiology . 142(4):74 1-746 (1996)). SafBl from the first module is enlarged. 
Core sequences 1 and 5 in safBl are weakly conserved (indicated by dashed 

30 vertical bars). The remaining domains are typical of type I as shown in Figure 
19 A. Safe is a putative O-methyltransferase. Figure 19C illustrates the 
organization of CPS J. Sequence analysis revealed two amino acid activating 
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modules (CPSl A and CPS IB), both of which have high similarity to safBl except 
that core 2 is weakly conserved. A thioesterase domain is found at the C-terminal 
region of CPSIB. Three vertical an-ows indicate the positions of targeted gene 
disruptions in the wild type genome that yielded the mutant phenotype. TES 1 is a 
5 thioesterase encoded by a separate gene (TESl), 

Figures 20A-F are comparative alignments of core amino acid 
sequences in CPSl A and CPSIB with those of other peptide synthetases. Figures 
20A-E are comparative amino acid sequence alignments of amino-acid-activating 
domains (cores 1-5); Figure 20F is a comparative amino acid sequence alignment 

1 0 of the thioester formation domain (core 6). In each subfigure, the first column 
shows the names of peptide synthetases; the second indicates the position of the 
first residue aligned in the original amino acid sequence of each protein; the last 
column on the right indicates the number of amino acids between two cores 
(Figures 20A-E, in parentheses) or the distance between two adjacent amino-acid- 

1 5 activating modules (Figure 20F, in parentheses). There is an extra column on the 
right in Figure 20F, showing the total number (underlined) of residues in each 
amino-acid-activating module in which the aligned core sequence is located. The 
consensus of each core sequence is on the top, which includes identical or similar 
residues found in all peptide synthetases or with only a few exceptions (active site 

20 is also indicated by asterisks). SafBl : the first module in saframycin Mxl 
synthetase B of Myxococcus xanlhus (Genbank accession U24657); GrsA: 
gramicidin S synthetase A of Bacillus brevis (SWISS PROT accession P14687); 
HTSIA and HTSIB: the first two modules in HC-toxin synthetase of 
Cochliobolus carbonum (Q01886); EsynA and EsynB: two modules in enniatin 

25 synthetase ofFusarium scirpi (EMBL accession Z18755); ACVA and ACVB: the 
first two modules in ACV synthetase of Aspergillus nidulans (SWISS PROT 
PI 9787); CsynA and CsynB: the first two modules in cyclosporine synthetase of 
Tolypocladium nivenm (EMBL Z28383). 

Figure 21 is a comparative alignment of amino acid sequences of 
30 active sites of thioesterase domains (TE) in CPS 1 with those of other peptide 
synthetases. ACV: ACV synthetase (Swiss-PROT accession PI 9787); GrsB: 
gramicidin S synthetase B (PI 4688); GrsT: the thioesterase encoded by grsT 
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(P14686) in gramicidin S synthetase gene cluster; SrfA: surfactin synthetase A-3 
(Q08787); TycC: tyrocidine synthetase C (Genbank accession AF004853); TycF: 
the thioesterase encoded by fycF (AF004853) in the tyrocidine synthetase gene 
cluster. The highly conserved residues (GXSXG) are indicated by asterisks. The 
5 number on the left of each amino acid sequence indicates the original position of 
the first residue; the number on the right (in parentheses) indicates the distance 
between the last residue shown to the end of each protein. 

Figure 22 is a comparative alignment of the amino acid sequence 
of the TESl protein (CH-TESl) with that of other type II thioesterases. HS-TEII: 

10 Homo sapiens thioesterase II (EMBL accession X86032); EC-TESB: E. coU acyl- 
coA thioesterase II (Genbank accession M63308); MT-TESB: Mycobactrium 
tuberculosis homolog to E. coll acyl-coA thioesterase II (EMBL Z95387). The 
identical residues in all four proteins are in bold. The putative active site VHS 
motif is indicated by asterisks. The numbers on the right column indicate the 

1 5 original position of the last residue of the line in each protein sequence. The 
entire protein sequence of each TES was aligned using the Jotun Hein Method. 
Amino acids corresponding to the positions 142-171, 236-265 and 356-367 in CH- 
TESl have no significant similarity among the four proteins. 

Figures 23A-B are photographs displaying a plant assay (Figure 
20 23A) and a gel blot (Figure 23B). Targeted gene disruption suggests that CPSl is 
involved in fungal pathogenesis. In Figure 23 A, N-cytoplasm corn was inoculated 
with (left to right): The REMI mutant R.C4.2696; wild type race T; wild type race 
O and five disruptants obtained using the linearized p214B7 as vector. All 
disruptants give the small lesions similar to the original mutant. In Figure 23B, 
25 total genomic DNA was digested with Bglil and probed with both 5' and 3' end 
flanking DNA fragments carried on p214B7 separately (strain order is the same as 
above). A single band (4.2 kb) is present in both wild type race T and race O but 
replaced by a 9.3 kb band (increased by the size of the vector, 5.1 kb) in all strains 
that showed the mutant phenotype in Figure 23A. 

30 Figure 24 displays a gel blot analysis showing targeted integration 

of the chromosome walking vector p214SNP into the wild type genome. Lane 1: 
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wild type race O. Lanes 2-8: seven transformants obtained by transforming wild 
type race 0 (strain C5) with p214SNP. Genomic DNAs were digested with Bglll 
and probed with 3.2 kb flanking genomic DNA fragment cloned on p214Sl which 
gives three bands in wild type (4.2, 2.1 and 0.5 kb, indicated by arrows on the 

5 left). The targeted site is in the 2. 1 kb fragment which is missing in all disruptants 
(indicated by an arrow on the right). Other two bands are intact as predicted. All 
disruptants showed the same mutant phenotype in the plant assay as shown in 
Figure 23. Five disruptants (lanes 2, 3, 5, 7 and 8) resulted from a single 
crossover integration. Integration in two other disruptants (lanes 4 and 6) has not 

10 been determined. One of the disruptants (#118, lane 2) was used to recover the 
plasmid pi 18B14 and pi 18BC4. 

Figure 25 displays a gel blot analysis showing targeted integration 
of the chromosome walking vector pllSBSP into the wild type genome. Lane 1 : 
wild type race 0; Lane 2: wild type race T; Lanes 3, 4 and 5: three transformants 
obtained by transforming wild type race O (strain C5) with pi 1 8BSP. Genomic 
DNAs were digested with Pst I (which cuts pUCATPH) and probed with the 3.2 
kb flanking genomic DNA fragment cloned on p214Sl which gives two bands in 
wild type (6.6 and 2.5 kb, indicated by arrows on the left). The targeted site is in 
the 6.6 kb region which is missing in all three disruptants (indicated by arrows on 
the right). The 2.5 kb band is intact as predicted. The third band (part of vector 
plus 4.6 kb genomic DNA) does not hybridize to the probe. All disruptants 
showed the same mutant phenotype in the plant assay as shovm in Figure 23. Two 
disruptants (lanes 3 and 4) resulted from a double crossover integration. 
Integration in the third disruptant (lane 5) has not been determined. One of the 
disruptants (#9, lane 3) was used to recover the plasmid p9P2. 

Figure 26 displays a gel blot showing the detection ofCPSI 
homologs in C victoriae and C carbonum. Genomic DNAs were digested with 
Bgh\ and probed with the 3.4 kb CPSl fragment cloned on p214B7 (Figure 2) 
which includes most of the 4.2 kb BgRl fragment of CPSl (Figure 15). Lanes 1 
30 and 2: C heterostrophus race T (C4) and race 0 (C5), both of which hybridized to 
• the 4.2 kb fragment. Lanes 3: C. victoriae (Hvw). Lanes 4, 5 and 6: C carbonum 
race 1 (26R13), race 2 (YugY) and race 3 (BZ1209). Note that both C victoriae 
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and C carbonum (three races) hybridized to a 5.0 kb fragment and the 
hybridization signals of all three species showed about the same intensity. 

Figure 27 is a high-performance liquid chromatography (HPLC) 
profile of culture extracts from wild type C victor iae (HvW, top left) and three 
5 transformants (Tx7, Tx2 and Tx9). The major peak for victorin C is indicated by 
arrowheads. No significant differences in victorin production were detected 
between the wild type and transformants. Three other transformants (Tx4, Tx5 
and Tx8) gave the same results. 

Figure 28A-B are photographs which display a plant assay (Figure 

1 0 28A) and a gel blot (Figure 28B), showing targeted disruption of the CPSl 

homolog in C victoriae. Figure 28A shows C. victoriae transformants (Tx) with 
reduced or wild type pathogenicity to susceptible oats. Oat seeds were inoculated 
with conidial suspensions of (left to right) wild type, Tx7, Tx2 and Tx9 (the last 
pot on the right is the uninoculated control). Two transformants (Tx7 and Tx2) 

1 5 showed dramatically reduced pathogenicity as indicated by the substantial growth 
of the oat plants. These two transformants resulted from a homologous 
integration (lanes 2 and 3 in Figure 28B). Tx9, which killed all oat plants as wild 
type, resulted from a ectopic integration (lane 4 in Figure 283). All three 
transformants produced wild type level of victorin as determined by HPLC 

20 analysis (Figure 27). Figure 28B shows disruption of CPSl homolog in the wild 
type genome. Genomic DNAs were digested with Bglil and probed with the 3.2 
kb CPSl fragment {Kpnl-Sacl) cloned on p214Sl (Figure 2) which hybridized to 
two fragments (4.2 and 2.2 kb, see Figure 15) in C heterostrophus (lane 1), but to 
three fragments (5.0, 1.8 and 0.2 kb) in the C. victoriae (lane 2). In Tx7 (lane 3) 

25 and Tx2 (lane 4), one or two of the wild type fragments was replaced by a larger 
fragment (8.0 kb for Tx7 and 9.4 kb for Tx2) containing the transforming vector. 
The actual size increase by the vector integration can not be predicted because the 
presence of polymorphic bands in C. victoriae genome and the presence of 
duplicated CPSl fragments on the transforming vector (Figure 5), In Tx9 (lane 

30 5), which caused wild type symptoms shown in Figure 28A, all three wild type 
bands (indicated by an-ows) are intact, confirming an ectopic integration. 
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Figure 29 shows the REMI vector pUCATPH. This vector was 
constructed by insertion of a 2.4 kb Sail fragment containing the selectable marker 
cassette (the largest arrow) from pDH25 (Cullen et al., 1987) into the Sail site of 
polylinker site of pUC18 (between lad and lacZ). Only six-base-pair restriction 

5 enzyme sites are shown on the map. Italicized sites are unique; three of them, 
////idlll, Kpn\, and Sad (bold) have been used for REMI transformation. Non- 
cutting enzymes are listed below the map (enzymes that recognize six-base-pair 
sites are underlined), amp, Ampicillin resistance gene; hygB, hygromycin B 
resistance gene; PtrpC, A. nidulans trpC promoter; TtrpC, A. nidulans trpC 

10 terminator; ori, Escherichia coli origin of replication. 

Figures 30A-30C are photographs of DNA gel blots showing 
DNA-DNA hybridization of ChCPSl to other fungal genera and species. In 
Figure 30A, the gel was loaded with Cochliobolus species (lanes 1-17) as follows: 
C. heterostrophs race T, race O; C. carbonum race 1, race 2; C. victoriae isolates 

15 F13, HvW; C. bicolor, C. dactyloctenii, C. chloridis, C. homomorphus, C. 

intermedius, C. melinidis, C. melinidis, C. peregianensis. C. perotidis, C. ravenelii 
and C. sativus. Figure 30B is a photograph of a DNA gel blot from a gel loaded 
with other Ascomycete genera (lanes 1-14) as follows: C carbonum racel 
(control), Setosphaeria rostrata, Stemphyllium spp.. Pyrenophora tritici repentis, 

20 Bipolaris sacchari, Alternaria spp., A. solani, Neclria haematococca, Fusarium 
oxysporum. Glomerella spp. Magnaporlhe grisea, F. moniliforme, F. moniliforme 
(repeat) and A. solani (repeat). Figure 30C is is a photograph of a DNA gel blot 
from a gel comparing Candida albicans to C. heterostrophus and closely related 
species (lanes 1-7): C. heterostrophs race T, Bipolaris sacchari. Setosphaeria 

25 rostrata, Stemphyllium spp., Pyrenophora tritici repentis, Alternaria spp. and 
Candida a/f>/can5(arrowhead). Genomic DNAs were digested with Hindm (A, 
lanes 1-17; B, lanes I-11;C, lanes 1-7), J^ol (B. lanes 12and 14)or5gm (B, 
lane 13) and probed with the 3.2 kb fragment oiCPSl from p214Sl (Fig. 2) at 
high stringency. Weak signals in lanes 3 and 17 (panel A) are due to insufficient 

3 0 DNA loading (confirmed by a repeat experiment) . 

Figure 31 A is a structural comparison of the ioMT,CPSl homologs 
to ChCPSl. ORFs are indicated by the open boxes; shaded boxes inside indicate 
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functional domains; vertical bars indicate conserved motif sequences found in 
nonribosomal peptide synthetases (NRPS) as defined by Stachelhaus and Marahiel 
(Stachelhaus and Marahiel, 1995; Marahiel, 1997) (dashed bars indicate weak 
conservation). The black bulbs indicate the position of putative introns. Cores 1-5: 

5 adenylation; core 6: thioaltion; TE: thioesterase. The distance between core 
sequences is not drawn in exact scale. The name of proteins is on the left of the 
ORF boxes and the number of amino acids on the right. The unidentified regions 
of AsCPSJ and PtCPSJ are indicated by dash-lined boxes. The similarity to 
ChCPSl (in the overlapping region only, see text for details) is given in the 

10 parentheses under the protein names in the order: nucleotide identity/ amino acid 
identity/ amino acid similarity. The positions of the ChCPSl amino acid 1040 is 
indicated by the open arrow; the positions 511 and 1269 (to the first and the last 
amino acids of AsCPSl and PtCPSl) are indicated by filled triangles. Figure 3 IB 
is an amino acid alignment of the four CPSl homologs to ChCPSl. 530 amino 

1 5 acids aligned to the amino acids 51 1-1040 of ChCPSl(shown in A) are shown. 
The identical residues are in uppercase and the similar residues in lowercase. 
Consensus of sequences similar to the typical NRPS signature motifs is 
underlined. The putative cyclization domain motif "DXXXXD/ EXXS/ A" is 
underlined. 

20 Figure 32 is the nucleotide sequence of FgCPSL 6,003 base pairs 

cloned using the plasmid rescue procedure are shown. The amino acid sequence of 
FgCPSl protein is given below the DNA sequence. The position of the start codon 
ATG (bold and underlined) is designated +1 and the open reading frame stops at 
position 5123 (TGA, bold and underlined). A "CP' motif (italicized and 

25 underlined) and two putative "CAAT" boxes (bold with asterisks) are found at 

positions -30, -204 and -302. A putative intron (in lowercase with 5' and 3' splice 
sequences in bold; branch sites underlined) is located at positions 4245-4290. 
Conserved core sequences are shaded and the putative cyclization domain motif 
"DXXXXEXXA" (position 2323-2346) is underlined. A putative polyadenylation 

30 signal "AATAA" at position 5 1 88 is bold and overlined. 

Figure 33 shows the nucleotide sequence of AsCPSL 2,369 base 
pairs amplified by PCR are shown. The amino acid sequence of AsCPSl protein is 
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given below the DNA sequence. The sequence is not complete. Two putative 
introns (in lowercase with 5' and 3' splice sequences in bold; branch sites 
underlined) are located at positions 540-584 and 2012-2059. Conserved core 
sequences are shaded and the putative cyclization domain motif "DXXXXDXXS" 
5 (positions 694-720 ) is underlined. The PGR primer binding sites at the 5' and 3' 
end are underlined. 

Figure 34 shows the nucleotide sequence of PtCPSL 2,320 base 
pairs amplified by PGR are shown. The amino acid sequence of PtGPSl protein is 
given below the DNA sequence. The sequence is not complete. A putative intron 
10 (in lowercase with 5' and 3' splice sequences in bold; branch sites underlined) is 
located at positions 540-583. Gonserved core sequences are shaded and the 
putative cyclization domain motif "DXXXXDXXS" (positions 693-719) is 
underlined. The PGR primer binding sites at the 5' and 3' end are underlined. 

Figure 35A is a photograph of a DNA gel blot showing that the 
1 5 2.2 kb wild type band (arrowhead) is disrupted in homologous transformants 

TxFgG8-4, -10, -1 1 and -5 but is intact in the ectopic transformant TxFgG8-Hl, - 
H2and-Bl [generated using ///wt//// (H) or fig///(B)-digested pFgG8-hygB]. 
Genomic DNAs were digested with Clal and probed with pFgG8 which carries a 
1.0 kb FgCPSI fragment. Figure 35B is a photograph of plants from a virulence 
20 assay showing F. graminearum transformants (Tx) with reduced or wild type 
virulence to wheat. Wheat heads were inoculated with conidial suspensions 
{\(flm\) of (left to right) wild type, TxFgC8-4, -10, -HI, and -1 1 or with water 
only. Photograph was taken 7 days after inoculation. Note that most spikelets of 
wheat heads inoculated with homologous transformants looked "healthy" in 
25 contrast to those inoculated with ectopic transformants that were completely 
"bleached'* (indistinguishable from wild type). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention generally relates to an isolated DNA 
molecule from a plant pathogen encoding a GPSl peptide synthetase. In one 
30 embodiment, the DNA molecule has a nucleotide sequence which hybridizes to a 
DNA molecule having a sequence corresponding to SEQ. ID. No. 2 as follows: 
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TGCCTGCGCC TGTGCTTGTG CCTGTGGAAT 
TACATACAAC ACCATCCCAT CCCGCTTCAC 
CCGCCGCCCA CAACACCATG GCTGCXjACCA 
TGGACCACGA GCTCGAGGAG GGCGATATTA 
5 ACCGCCATCC GCCTCTCTGC GTGCGCTAAT 
GTGCTGCTGT CGCAGTATCT AGGGCCTGAC 
CAGCAGAACC CACCCCAACC ATCCAGTGAG 
ATTCCGTCCG GTCCGAGTCC ATCACNGCGA 
CCCGACTCAT ACCATGACGC TTCCGCACAG 
10 AACGCCTCCG CCGCTGCCTC GGGGGGCTCG 
GGCCGTTTTC AAGAGAAGCA GCTGGGCCTG 
CAGCTGTCGC AAGGAAGCGA GACGTTCATT 
TCGCGCGAGC CCACCATGAT GGGCAACTAC 
GATGGCCAAT TTGGCTCTCC GGGAGAGGCC 
15 CAGGGTTATT TTTCCGACTT CACAGGCCAG 
GGACCCAACC GCTACTCGTC GGGAGATGCC 
ATGATGAACC CCAACGATCT CCCCTTGGGC 
CGCGATCTGC CTTTTGACX3T TTACGACCCT 
GACAACATTG GCGCTGTCTT GCGTCACCGA 
20 TGGGTCCTTG ACGCAAAAGG CAAAGAGACG 
CGCGCGGAAA AGGTGGCCAA AGTGATTCGG 
GTGGCATTAG TGTACAGGGA TACAGAAATC 
TTCATTGCGG GCGTTGTAGC GGTACCCATC 
CTTCTCCTAA CGACAACTCA AGCTCATCTC 
25 TTTCATCGTG ACATTAGTCA GAACCGTCTG 
ACGAACGAGT TTGGCAGCCA CCACCCCAAG 
CCAGAGGTTG CCTATATTGA GTTCTCGCGT 
CTTAGTCACC GGACTATTAT GCACCAAATG 
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GTCGCGGCCC GCTGCTGCAT AGCCTATCTG 60 

CTGCCTTGCC TCCCTCCTCG TGCCACACAT 120 

ACCCCGAGCT GCAGGCCAAA CTGCAGGAGC 180 

PVCAAAAAGG GTCCGTACTG CTGCACCACC 24 0 

CAGTCGCATA GCTATGAAAA ACGTCGCACC 300 

TTTGCTGCCC AGTTGCAGGC CGACCTGAAC 360 

GGCTCTCGCT CCCGCACCGC ATCCTTTGCT 420 

CCACAACCCC CACATATCCA GCTCCCCCGC 480 

GGCCAATTGG GCGCACCCAT GCCATATGCG 540 

CAGTACATGG CATACCCGCC CAGCCAAGTC 600 

CGTACAAATT CGCTCCAGCG CAATTCCTCA 660 

CCACGGCCTC AAACGCCTGA ATACAACCAC 720 

GCCTTCAATC CAGACAATCA GCAAAGTTAT 780 

AGTCGAAGGA GCACCATGCT CGAGGTAAAC 840 

CAGATGCAAG ACAATCGCGA CTCGTATGGG 900 

TTTTCTCCTA CCGCCGCGAT TCCACCTCCC 960 

GCTGCTGAAA CCATGATGCC GCTAGAGCCC 10 20 

CACAACCCCA ATGTCAAAAT GTCAAAGTTT 1080 

AGTCGCACAC AGCCAAGGAC GACTGCCTTC 1140 

GCGTCCATCA CCTGGGAAAA GGTGGCTAGT 12 OO 

GACAAGAGCA ACCTCTATCG AGGCGACCGT 1260 

ATTGATTTTG TCGTGGCGTT GATGGGCTGC 1320 

AATAGCGTCG ACGACTACCA GAAACTCATT 13 80 

GCATTGACCA CAGACAACAA TCTCAAGGCC 144 0 

AAATGGCCGA GTGGGGTAGA GTGGTGGAAG 15 OO 

AAACATGACG ATACTCCAGC TTTGCAAGTA 1560 

GCACCTACTG GTGACCTTCG CGGTGTGGTG 1620 

GCCTGCATCA GTGCCATGAT TAGCACGATA 1680 
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CCCACCAACG CTCAGAGCCA AGACACGTTC 
TTCGTTGCTC CAGCACCGTC CAGAAACCCC 
CX3CGAAAGCG CTGGTCTCAT TCTCAGTGTC 
GTATGGCTCG AGACAGCGAC CATGGAAACC 
5 TACAAGTCCA ACATACTGCT AGCGGATTAC 
CAACAGGATC CAATGGCTAC AAGAAACTTC 
GTGAAGATCT GTCTGATTGA CACGCTTACC 
GATCGATATT TCAGGCCACT GCGAAACCCT 
TGCTTGCCAG AACATGGTGG AATGATAATA 
10 CGCATGGGCT GCCCGCTAAG CATAGCAGTA 
GAGGATAAGT ATGCAGCGGC AAATGGCTAC 
AACAAAAAGG AGAAGAAGAA GAAAGGCCCG 
GAAGCTCTGA AGATGAACGA AGTCATTGTT 
GCAAACGAGC CCGGCACCAT GCGAGTCGGT 
15 CTAGCTATTG TAGACCCTGA GACAAGTCTT 
TGGGTAGATT CGCCTTCACT CTCTGGTGGC 
ATTTTCCATG CTCGACCATA CCGTTTCGTT 
CTCGAGTTTC TGCGTACTGG ACTCCTCGGC 
GGACTGTACG AAGATCGCAT CAGACAGCGT 
20 GCCGAGCATC GATACTTTTT TGTGCAGCAC 
AAAATTTACG ACTGGTAAGT GAGCTGCCAA 
TCGTCGTTTG ATTCTTATGT AAATGGTGAA 
GCCGCATCGA CTGCGCCCAC AAACCCAGGT 
TTGGATTCAC TATCTGAGAG GTGCATGGAG 
25 TACTGCGTGA TGATTACAGC ACCTAATACA 
GAAATTGGCA ATATGCTGTG TANGAGANAG 
GTNAAGTTTG GCATTGAGCG ATCAGTGCAG 
GGCATGTGGT CATTTGAGGC ATCAATGGCA 



AGCACTAGCC TACGGGATGC AGAGGGAAAG 1740 

ACAGAAGTGA TCCTCACGTA CCTCGACCCG 1800 

TTGTTTGCAG TTTATGGAGG CCACACCACC 1860 

CCGGGTCTAT ATGCACATCT CATCACCAAA 1920 

CCAGGCCTCA AGCGCGCTGC ATACAACTAC 1980 

AAGAAAAACA CAGAACCCAA CTTCGCCTCC 2040 

GTCGACTGTG AATTTCACGA AATTCTCGGA 2100 

AGAGCGCGAG AACTGATCGC GCCAATGCTC 2160 

TCTGTACGCG ACTGGCTAGG TGGAGAGGAG 2220 

GAAGAGTCAG ATAATGATGA AGATGATACA 2280 

TCCAGTCTTA TTGGTGGTGG CACTACAAAG 2340 

ACAGAGCTTA CAGAAATCTT GCTGGACAAG 24 00 

CTGGCCATTG GAGAAGAAGC AAGCAAGCGG 2460 

GCCTTTGGAT ACCCCATACC GGATGCGACA 2520 

CTATGTTCAC CATACTCGAT AGGCGAGATC 2580 

TTCTGGCAGC TGCAGAAGCA TACAGAGACC 2640 

GANGGTAGCC CTACGCCACA GTTGCTTGAA 2700 

TTTGTTGTAG ANGGAAAAAT ATTTGTCCTT 2760 

GTTGAATGGG TAGAAAATGG TCAGCTTGAA 2820 

CTGGTCACAA GCATTATGAA GGCCGTGCCA 2880 

CAGAGCAAGG ACTGTCTAAC GTGTCATAGC 2940 

TACCTGCCAA TCATTCTCAT CGAGACGCAG 3000 

GGACCACCAC AACAATTGGA TATACCATTT 3060 

GTCCTTTACC AAGAGCATCA TTTACGGGTA 3120 

CTTCCACGAG TCATCAAGAA CGGACGGCGA 3180 

TTTGACAATG GCTCTCTGCC CTGTGTNCAC 324 0 

AACATTGCGC TCGGTGACGA TCCCGCTGGC 3300 

CGTCAGCAAT TCTTGATGCT CCAAGACAAG 3 360 
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CAATACTCTG GTGTCGATCA TCGCGAAGTC 
AATCAGTTCT CGAATATCCA CGACCTGATG 
CTTGCTTACT GCACTGTCGA CGGTCGAGGA 
TTTGATCAAA AGGTTGCGGG CGTAGCAATG 
5 GGCGATCATC TCCTTCTGAT GTACACGCAT 
TGTTTTGTGC TTGGAGCTGT TTGCATACCA 
GAGGATGCGC CGGCCTTGCT GCATATCCTT 
AACGCTGACG TTGACCATCT GATGAAGATC 
GCCGCTATCC-TCAAGATCAG TGTGCCAAAC 

10 TCCAGTGGCT GCCGCX3ACCT CAAGCTTACA 
CCAGTGCTAG TCTGGACATA CTGGACGCCC 
CATAGCCAAA TCATGGCACT GTGCAAGGTC 
CGACCAGTCC TTGGTTGTGT CCGGAGCACG 
ATGGGAATCT TCCTTGCCGC ACCCACATAC 

15 CCTAATATTC TGTTCCAAAC GCTTTCGCGG 
CAAATGTTGG ACCACGCCAT CGCACGCGGA 
AAGAATCTCA TGATTGCGAC TGATGGAAGA 
GTATGAGAGG ACTTTCATGA TTGCTAACTC 
GCGCCAGCCA ACTTAGACCC AACCGCAATC 

20 ATGGTAGCAT CACGATCATA CATGTGTATT 
GCTCTGCGAC GCGGCCTCGT CATGCCCGTT 
GTCCAAGACT CGGGCATGGT GCCAGTGAGC 
AACCAACTGT GCTTGAACGG CGAGTACGGC 
TATAGCTTCT ACATGTCGAA AGAGCGCTTG 

25 GACGGAGACC CAAATGTGCG ATATGTTCGT 
ACACGGCCCA TTGGACCCAA CGGTGCACCT 
AGCATAGGTG ACACTTTTGA AGTCAACGGA 
TCTGTTGAAC GTTGTCACCG GAATATTGTC 



GTCATTGACG ACAGGACATC GACTCCACTC 3420 

CAATGGCGTG TATCTCGGCA GGCCGAGGAA 3480 

AAAGAGGGCA AAGGCGTCAA TTGGAAGAAG 3540 

TACCTCAAGA ACAAGGTCAA GGTCCAGGCC 3600 

TCAGAAGAAT TTGTTTATGC TGTTCATGCA 3660 

ATGGCGCCAA TTGATCAGAA CCGGTTGAAT 3720 

GCAGATTTCA AGGTCAAAGC CATTCTTGTC 3780 

AAGCAAGTAT CXSCAGCACAT CAAACAATCG 3840 

ACATACAGCA CAACAAAGCC GCCAAAGCAA 3900 

ATTCGACCGG CATGGATTCA GGCGGGTTTC 396 0 

GATCAACGTC GTATCGCAGT TCAGCTGGGC 402 0 

CAAAAAGAAA CATGCCAAAT GACAAGTACA 4 08 0 

ATAGGACTTG GTTTCCTTCA CACTTGTCTC 414 0 

CTGGTGTCAC CTGTTGACTT TGCACAAAAC 4200 

TACAAGATCA AGGATGCATA TGCAACGAGT 426 0 

GCTGGTAAGA GTATGGCTCT GCACGAGCTG 4320 

CCACGCGTTG ATGTTTGTAA GTGAACATTT 4380 

AATGCAGACC AAAGAGTGCG TGTGCACTTT 444 0 

AACACTGTCT ACTCACATGT ATTGAACCCA 450 0 

GAGCCAGTCG AGCTCCATCT CGATGTGCAT 4560 

GACCCTGACA CAGAGCCCAA CGCTTTGCTC 4620 

ACGCAAATAT CCATTGTCAA CCCAGAGACC 4680 

GAGATCTGGG TGCAGTCCGA GGCGAATGCT 4740 

GATGCAGAAC GCTTCAATGG GAGGACGATT 4 800 

ACAGGCGATT TAGGATTTTT GCACAGCGTG 4 860 

GTTGATATGC AGGTGCTTTT CGTGCTTGGA 4920 

CTGAACCATT TCTCTATGGA CATTGAGCAG 4 980 

CCTGGAGGCT GGTACGTTTC TTCGATTCGC 504 0 
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TGTTATTTAG TAAATACTTA CTAACACTCT ACAGTGCTGT TTTCCAGGCA GGTGGGCTTG 5100 

TTGTTGTCGT TGTGGAAATC TTCCGACGCA ACTTCCTCGC AAGCATGGTG CCTGTGATTG 5160 

TCAATGCAAT TTTGAACGAG CATCAGCTGG TCATTGACAT TGTCTCGTTT GTGCAAAAGG 5220 

GCGACTTCCA CCGGTCTCGT CTGGGCGAGA AGCAACGCGG AAAGATTCTT GCAGGATGGG 52 80 

5 TCACACGGAA GATGCGCACA ATAGCCCAGT ACAGTATACG GGATCCTAAT GGACAGGATT 5340 

CCCAGATGAT CACGGAAGAG CCTGGTCCAC GGGCTAGATG ACTGGAAGTA TGCTTGGGCG 54 00 

AATGGGCGGC CCAGCCAGTA TCAAGGCCGG GTCGACAAGA GCACCGAGTC TAATGGGCAT 54 60 

GACAGCGACT ATGAATAATC TATCCCTTAC ACAGCAGCAA CAGCAGCAAT ACCAACAGCC 5520 

GGGTATGTAT GCTCAACAGC AAGGCATGCA- CCCCCAGCAA CAACACCAAT TTAGCATGTC 5580 

10 CAACACGCCA CCACAAGGTC CACCCCAAGG CGTAGAACTA CATGATCCTA GCGACCGCAC 5640 

ACCAACAGAC AACCGGCACT CTTTCCTTGC CGACCCGCGT ATGCAGAACC AGGGCCAAAT 570O 

GAACGAGACG GGCGCCTACG AACCCATGAA CTATCAAAAC GCGTATCATC CGCATCAACA 5760 

ACAATACGAA TCTGAAGACG GGGGGAGCAG ACTCAGCGGC CCCGTGCCAG ACGTGCTGCG 58 20 

GCCGGGTCCT TCATCCGGGT CCATAGAGCA GCACGACCAA GCTAACAACG ACAACAATAT 58 80 

15 GTGGAATAAT CGCGAGTACT ATGGTAACAG CCCATCGTAT GCAGGCGGAT ACACGCAAGA 594 0 

TGGCAATATC CACGAGCAGC AACAACACGA TGAGTACACG AGTAATGCGT CATATGGCGG 6000 

AAATCAAGGA GCAGGCGGAG GCAGCGGCGG CGGTGGCGGT CTCCGAGTTG CAAATCGTGA 6060 

CAGCTCCGAC AGCGAGGGTG CAGATGACGA CGCTTGGAGA CGTGATGCCC TTGCTCAGAT 6120 

CAATTTTGCG GGCGGCGCTG CTGCTGCCTC CGCTGGAGCA CCTGCTGCTG GTGCTTCTTC 6180 

20 TTCGCAGCCG GGCCATGCGC AGTAGACGGG ATATGCGTGA GTTTTTTTTT AAATTTCGTA 624 0 

CATAGAGACC GTTGTATACG CAGGTTTCAA ATTAGAAGAG CGAATATGCA TATCAGCTGT 6300 

TGTTCAATGT TCTAGTTTGG GAAGGTTAAC CCCCCCCCCT TCCCCTTCCA AGACTTTTCA 6360 

CTTGTTTGTG TGTGATTTAA ATCTGGAGAT TTCAAATCTA CATCTCGCTA TACATAGGTG 6420 

TTGTTTGATA ACGTAGGGGG CAGAAGGGTA TCTCGTGATA TTAGACTGGG AGTTGCATGA 64 80 

25 ATCAAGGTGT TGAGCAAAAA AAGAGAGAGC GGTGAAGGGC GGGGGGGATA GGTGGTGTGC 654 0 

ACGTGGCTG 654 9 

In a preferred embodiment, the plant pathogen is Cochliobolus 
heterostrophus. In another preferred embodiment, the plant pathogen is 
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Cochliobolus carbonum, C victoriae, C sativus, C. specifer, C. homomorphus, C 
dactyloctenil Seiosphaeria turcica, S. rostrata, or Bioplaris saccharL 



The peptide synthetase of SEQ. ID. No. 2 has a deduced amino 
acid sequence corresponding to SEQ. ID. No. 3 as follows: 
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Pro Gly Leu Lys Arg Ala Ala Tyr Asn Tyr Gin Gin Asp Pro Met 
Ala Thr Arg Asn Phe Lys Lys Asn Thr Glu Pro Asn Phe Ala Ser 
val Lys lie Cys Leu He Asp Thr Leu Thr Val Asp Cys Glu Phe 
His Glu lie Leu Gly Asp Arg Tyr Phe Arg Pro Leu Arg Asn Pro 
5 Arg Ala Arg Glu Leu lie Ala Pro Met Leu Cys Leu Pro Glu His 
Gly Gly Met He He Ser Val Arg Asp Trp Leu Gly Gly Glu Glu 
Arg Met Gly Cys Pro Leu Ser He Ala Val Glu Glu Ser Asp Asn 
Asp Glu Asp Asp Thr Glu Asp Lys Tyr Ala Ala Ala Asn Gly Tyr 
ser ser Leu He Gly Gly Gly Thr Thr Lys Asn Lys Lys Glu Lys 
10 - Lys Lys Lys Gly Pro Thr Glu Leu Thr Glu He Leu Leu Asp Lys 
Glu Ala Leu Lys Met Asn Glu Val He Val Leu Ala He Gly Glu 
Glu Ala ser Lys Arg Ala Asn Glu Pro Gly Thr Met Arg Val Gly 
Ala Phe Gly Tyr Pro He Pro Asp Ala Thr Leu Ala He Val Asp 
Pro Glu Thr Ser Leu Leu Cys Ser Pro Tyr Ser He Gly Glu He 
15 Trp Val Asp ser Pro Ser Leu Ser Gly Gly Phe Trp Gin Leu Gin 
Lys His Thr Glu Thr He Phe His Ala Arg Pro Tyr Arg Phe Val 
Xaa Gly Ser Pro Thr Pro Gin Leu Leu Glu Leu Glu Phe Leu Arg 
Thr Gly Leu Leu Gly Phe Val Val Glu Gly Lys He Phe Val Leu 
Gly Leu Tyr Glu Asp Arg He Arg Gin Arg Val Glu Trp Val Glu 
20 Asn Gly Gin Leu Glu Ala Glu His Arg Tyr Phe Phe Val Gin His 
Leu val Thr Ser He Met Lys Ala Val Pro Lys He Tyr Asp Cys 
ser Ser Phe Asp Ser Tyr Val Asn Gly Glu Tyr Leu Pro He He 
Leu He Glu Thr Gin Ala Ala Ser Thr Ala Pro Thr Asn Pro Gly 720 
Gly Pro Pro Gin Gin Leu Asp He Pro Phe Leu Asp Ser Leu Ser 
25 Glu Arg Cys Met Glu Val Leu Tyr Gin Glu His His Leu Arg Val 
Tyr Cys Val Met He Thr Ala Pro Asn Thr Leu Pro Arg Val He 
Lys Asn Gly Arg Arg Glu He Gly Asn Met Leu Cys Arg Arg Glu 
Phe Asp Asn Gly Ser Leu Pro Cys Val His Val Lys Phe Gly He 
Glu Arg ser Val Gin Asn He Ala Leu Gly Asp Asp Pro Ala Gly 
30 Gly Met Trp Ser Phe Glu Ala Ser Met Ala Arg Gin Gin Phe Leu 
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735 
750 
765 
780 
795 
810 
825 



SUBSTITUTE SHEET (RULE 26) 



wo 01/38489 



PCT/USOO/32227 



Met Leu Gin Asp Lys 
Val lie Asp Asp Arg 
He His Asp Leu Met 
Leu Ala T/T Cys Thr 
5 Val Asn Trp Lys Lys 
Tyr Leu Lys Asn Lys 
Leu Met Tyr Thr His 
Cys Phe Val Leu Gly 
Gin Asn Arg Leu Asn 
10 Ala Asp Phe Lys Val 
His Leu Met Lys He 
Ala Ala He Leu Lys 
Lys Pro Pro Lys Gin 
He Arg Pro Ala Trp 
15 Thr Tyr Trp Thr Pro 
His Ser Gin He Met 
Gin Met Thr Ser Thr 
He Gly Leu Gly Phe 
Ala Ala Pro Thr Tyr 
20 Pro Asn He Leu Phe 
Ala Tyr Ala Thr Ser 
Ala Gly Lys Ser Met 
Ala Thr Asp Gly Arg 
Val His Phe Ala Pro 
25 Val Tyr Ser His Val 
Met Cys He Glu Pro 
Arg Arg Gly Leu Val 
Ala Leu Leu Val Gin 
He Ser He Val Asn 
30 Glu Tyr Gly Glu He 
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Gln Tyr Ser Gly Val Asp 
Thr Ser Thr Pro Leu Asn 
Gin Trp Arg Val Ser Arg 
Val Asp Gly Arg Gly Lys 
Phe Asp Gin Lys Val Ala 
Val Lys Val Gin Ala Gly 
Ser Glu Glu Phe Val Tyr 
Ala Val Cys He Pro Met 
Glu Asp Ala Pro Ala Leu 
Lys Ala He Leu Val Asn 
Lys Gin Val Ser Gin His 
He Ser Val Pro Asn Thr 
Ser Ser Gly Cys Arg Asp 
He Gin Ala Gly Phe Pro 
Asp Gin Arg Arg He Ala 
Ala Leu Cys Lys Val Gin 
Arg Pro Val Leu Gly Cys 
Leu His Thr Cys Leu Met 
Leu Val Ser Pro Val Asp 
Gin Thr Leu Ser Arg Tyr 
Gin Met Leu Asp His Ala 
Ala Leu His Glu Leu Lys 
Pro Arg Val Asp Val Tyr 
Ala Asn Leu Asp Pro Thr 
Leu Asn Pro Met Val Ala 
Val Glu Leu His Leu Asp 
Met Pro Val Asp Pro Asp 
Asp Ser Gly Met Val Pro 
Pro Glu Thr Asn Gin Leu 
Trp Val Gin Ser Glu Ala 
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He Ala Arg Gly 
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Gin Arg Val Arg 
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Val 
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Thr 
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Phe Tyr Met Ser Lys Glu Arg Leu 
Arg Thr He Asp Gly Asp Pro Asn 
Asp Leu Gly Phe Leu His Ser Val 
Gly Ala Pro Val Asp Met Gin Val 
5 Gly Asp Thr Phe Glu Val Asn Gly 
He Glu Gin Ser Val Glu Arg Cys 
Gly Cys Ala Val Phe Gin Ala Gly 
Glu He Phe Arg Arg Asn Phe Leu 
Val Asn Ala He Leu Asn Glu His 
10 Ser Phe Val Gin Lys Gly Asp Phe 
Lys Gin Arg Gly Lys He Leu Ala 
Arg Thr He Ala Gin Tyr Ser He 
Ser Gin Met He Thr Glu Glu Pro 
Gly Ser Met Leu Gly Arg Met Gly 
15 Gly Ser Thr Arg Ala Pro Ser Leu 
Asn Asn Leu Ser Leu Thr Gin Gin 
Pro Gly Met Tyr Ala Gin Gin Gin 
His Gin Phe Ser Met Ser Asn Thr 
Gly Val Glu Leu His Asp Pro Ser 
20 Arg His Ser Phe Leu Ala Asp Pro 
Met Asn Glu Thr Gly Ala Tyr Glu 
Tyr His Pro His Gin Gin Gin Tyr 
Arg Leu Ser Gly Pro Val Pro Asp 
Ser Gly Ser He Glu Gin His Asp 
25 Met Trp Asn Asn Arg Glu Tyr Tyr 
Gly Gly Tyr Thr Gin Asp Gly Asn 
Asp Glu Tyr Thr Ser Asn Ala Ser 
Gly Gly Gly Ser Gly Gly Gly Gly 
Asp Ser Ser Asp Ser Glu Gly Ala 
30 Asp Ala Leu Ala Gin He Asn Phe 
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Asp Ala Glu Arg Phe Asn Gly 1290 

Val Arg Tyr Val Arg Thr Gly 1305 

Thr Arg Pro He Gly Pro Asn 1320 

Leu Phe Val Leu Gly Ser He 1335 

Leu Asn His Phe Ser Met Asp 1350 

His Arg Asn He Val Pro Gly 1365 

Gly Leu val Val Val Val Val 1380 

Ala Ser Met Val Pro Val He 1395 

Gin Leu Val He Asp He Val 1410 

His Arg Ser Arg Leu Gly Glu 1425 

Gly Trp Val Thr Arg Lys Met 1440 

Arg Asp Pro Asn Gly Gin Asp 1455 

Gly Pro Arg Ala Ser Met Thr 1470 

Gly Pro Ala Ser He Lys Ala 1485 

Met Gly Met Thr Ala Thr Met 1500 

Gin Gin Gin Gin Tyr Gin Gin 1515 

Gly Met His Pro Gin Gin Gin 1530 

Pro Pro Gin Gly Pro Pro Gin 154 5 

Asp Arg Thr Pro Thr Asp Asn 156 0 

Arg Met Gin Asn Gin Gly Gin 1575 

Pro Met Asn Tyr Gin Asn Ala 1590 

Glu Ser Glu Asp Gly Gly Ser 1605 

Val Leu Arg Pro Gly Pro Ser 1620 

Gin Ala Asn Asn Asp Asn Asn 1635 

Gly Asn Ser Pro Ser Tyr Ala 1650 

He His Glu Gin Gin Gin His 1665 

Tyr Gly Gly Asn Gin Gly Ala 1680 

Gly Leu Arg Val Ala Asn Arg 1695 

Asp Asp Asp Ala Trp Arg Arg 1710 

Ala Gly Gly Ala Ala Ala Ala 172 5 
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Ser Ala Gly Ala Pro Ala Ala Gly Ala Ser Ser Ser Gin Pro Gly 1740 

1 743 

His Ala Gin 

In another embodiment of the invention, there is provided an 
isolated nucleic acid molecule encoding a CPSl peptide synthetase homolog and 
5 which hybridizes to a nucleic acid molecule having a sequence corresponding to 
SEQIDNo. 41 as follows: 

AAGAAGAAAG GGCCGACCGA GTTGACCGAA ATATTGCTAG ATAAGGAAGC ACTGAAGCTG 60 

AACGAAGTTG TTGTTTTGGC CATTGGAGAG GAAGTGAGCA AGCGTGTCAA CGAACCCGGC 120 

ACTATGAGAG TCGGTGCTTT TGGCTACCCG ATACCAGATG CGACGCTGGC CGTCGTCGAT 180 

CCGGAAACTA ATCTTTTGTG TTCACCCTAT TCCATAGGAG AGATCTGGGT AGACTCGCCA 240 

15 TCATTGTCCG GAGGGTTTTG GCAGCTGCAG AAGCACACTG AGACTATTTT CCACGCTCGG 300 

CCATATCGTT TCGTAGAGGG CAGCCCAACC CCGCAACTAC TCGAACTGGA GTTTCTACGC 360 

ACTGGACTGC TCGGATGCGT GGTAGAAGGC AAAATCTTCG TATTAGGCCT GTACGAGGAC 420 

20 

CGGATTAGGC AGCGCGTTGA ATGGGTAGAG CACGGTCAGC TAGAAGCCGA ACATAGGTAT 4 80 

TTCTTCGTGC AGCATCTTGT CACCAGCATT ATGAAAGCTG TTCCAAAGAT TTACGACTGG 540 

25 TAAGTGCTAT CGAATCTCTG GGTAATCAAC CTAACATTGC GCAGCTCGTC TTTCGATTCC 600 

TATGTCAACG GCGAATACTT ACCAATCATC CTTATCGAGA CACAGGCCGC ATCAACTGCT 660 

CCCACAAATC CAGGCGGGCC ACCACAACAA CTTGACATTC CTTTCCTAGA CTCTCTTTCT 720 

30 

GAGCGATGTA TGGAGGTACT GTATCAAGAA CACCACCTTC GGGTGTATTG TGTGATGATC 780 

ACTGCACCGA ACACACTCCC GCGAGTCATC AAGAACGGTC GACGAGAAAT TGGAAACATG 84 0 

35 CTTTGCCGGA GAGAATTTGA CAATGGCTCG CTACCCTGCG TTCACGTCAA GTTTGGCGTC 900 

GAGAGGTCGG TCCAGAATAT TGCGCTAGGT GATGACCCTG CTGGCGGCAT GTGGTCTTAC 960 

GAGGCGTCGA TGGCACGCCA GCAGTTCCTG ATGCTTCAAG ATAAGCAGTA CTCTGGAGTA 1020 

40 

GATCACAGAG AAGTCGTTAT TGACGACAGA ACGTCGACGC CGCTCAACCA GTTCTCCAAC 1080 

ATTCATGACC TTATGCAATG GCGCGTACAA CGACAAGCTG AAGAGCTCGC CTACTGCACG 1140 

45 GTAGATGGTC GAGGTAAAGA GGGCAAAGGC GTCAACTGGA AGAAGTTCGA CCAGAAGGTC 1200 

GCAGGTGTCG CCATGTACCT GAAGAACAAG GTCAAGGGTC AGACTGGTGA CCACCTGCTC 1260 

TTGATGTACA CCCACTCGGA AGACTTTGTC TATGCCGTAC ACGCGTGTTT CGTCCTTGGA 1320 

50 

GCTGTGTGTA TACCCATGGC ACCAATCGAC CAGAACAGGC TAAATGAAGA CGCGCCCGCA 13 80 

CTACTACATA TCATTGCTGA CTTCAAGGTC AAGGCTATCC TCGTCAATGC TGGCGTAGAC 14 4 0 

55 CACCTGATGA AGGTCAAGCA AGTATCGCAG CACATCAAAC AGTCAGCAGT CATTCTCAAG 1500 

ATCAACGTAC CGAATACCTA TAACACCACA AAACCACCTA AGCAGTCTAG TGGTTGCCGC 1560 
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10 



15 



20 



25 



45 



GATCTTAAGC 


TCACAATACG 


ACCTGCTTGG 


ATACAATCTG 


GTTTCCCTGT 


TCTAGTATGG 


1620 


ACATACTGGA CACCTGACCA GAGACGCATA 


GCTGTGCAAT 


TAGGTCATAG 


CCAAATCATG 


1680 


GCGCTATGCA AAGTTCAGAA AGAAACX3TGC 


CAGATGACGA 


GCACACGGCC 


CGTCCTTGGA 


1740 


TGTGTTCGTA GCACGATCGG 


TCTTGGCTTC 


ATACACACCT 


GTGTTATGGG 


TATCTTCCTC 


IBOO 


GCAGCGCCAA 


CTTACCTTGT 


GTCACCTGTC 


GATTTTGCGC 


AAAACCCGAA 


CATCCTCTTC 


1860 


CAGACCATGT 


CGAGATACAA GATCAAGGAC 


GCGTATGCGA 


CCAGCCAAAT 


GCTGGACCAC 


1920 


GCTATTGCAC 


GAGGTGCTGG 


CAAGAACATG 


GCTCTGCACG 


AGCTCAAGAA 


CCTCATGATC 


1980 


GCGACTGACG 


GTCGGCCGCG 


CGTAGACGTC 


TGTAAGTGTT 


GCGATCCTGT 


ATAAGCATCT 


2040 


GAAATCTAAT 


TCTTGATAGA 


CCAGCGTGTG 


CGAGTACACT 


TCTCGCCAGC 


AAGTTTGGAC 


2100 


CGAACGGCAA 


TCAATACTGT 


TTACTCACAC 


GTACTGAATC 


CTATGGTCGC 


ATCGCGGTCA 


2160 


TACATGTGCA 


TCGAACCCAT 


AGAACTACAT 


CTCGATGTCG 


GTGCCCTTCG 


AAGAGGTCTC 


2220 


ATCATGCCTG 


TCGACCCAGA 


CACGGAACCT 


GGTGCTCTCT 


TAGTCCAGGA 


CTCGGGTATG 


2280 


GTACCAGTTA 


GTACACAAAT 


TTCAATCGTG 


AATCCAGAGA 


CAAACCAGCT 


TTGCCTAGTC 


2340 


GGCGAGTATG 


GCGAAATCTG 


GGTCCAACC 








2370 



Preferably, the CPSl nucleic acid molecule which hybridizes to a 
30 nucleic acid molecule having a sequence as set forth in SEQ ID No. 41 is from the 

plant parogen Altenaria solani. In another preferred embodiment, the CPSl gene 

is from a plant pathogen such as Alternaia alternatherae, A, alternata, A. 

amaranths A, araliae, A, brassicae, A. brassicicola, A, camelliae, A. cassiae, A. 

cheiranthU A. cinerahae, A. gossypii, A. helianthi, A. helianthinficiens, A. mali, 
35 or A. raphani 

The peptide synthetase product of SEQ ID N0:41 has a deduced 
amino acid sequence as follows (SEQ ID NO:42): 

KKKGPTELTE ILLDKEALKL NEVWLAIGE EVSKRVNEPG TMRVGAFGYP IPDATLAWD 6 0 

40 PETNLLCSPy SIGEIWVDSP SLSGGFWQLQ KHTETIFHAR PYRFVEGSPT PQLLELEFLR 120 
TGLLGCWEG KIFVLGLYED RIRQRVEWVE HGQLEAEHRY FFVQHLVTSI MKAVPKIYDC 180 
SSFDSYVNGE YLPIILIETQ AASTAPTNPG GPPQQLDIPF LDSLSERCME VLYQEHHLRV 24 0 
YCVMITAPNT LPRVIKNGRR EIGNMLCRRE FDNGSLPCVH VKFGVERSVQ NIALGDDPAG 300 
GMWSYEASMA RQQFLMLQDK QYSGVDHREV VIDDRTSTPL NQFSNIHDLM QWRVQRQAEE 360 
50 LAYCTVDGRG KEGKGVNWKK FDQKVAGVAM YLKNKVKGQT GDHLLLMYTH SEDFVYAVHA 420 
CFVLGAVCIP MAPIDQNRLN EDAPALLHII ADFKVKAILV NAGVDHLMKV KQVSQHIKQS 480 
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AVILKINVPN TYNTTKPPKQ SSGCRDLKLT IRPAWIQSGF PVLVWTYWTP DQRRIAVQLG 540 
HSQIMALCKV QKETCQMTST RPVLGCVRST IGLGFIHTCV MGIFLAAPTY LVSPVDFAQN 600 
5 PNILFQTMSR YKIKDAYATS QMLDHAIARG AGKNMALHEL KNLMIATDGR PRVDVYQRVR 660 
VHFSPASLDR TAINTVYSHV LNPMVASRSY MCIEPIELHL DVGALRRGLI MPVDPDTEPG 720 
ALLVQDSGMV PVSTQISIVN PETNQLCLVG EYGEIWVQ . 760 

10 

In another embodiment of the invention, there is provided an 
isolated nucleic acid molecule encoding a CPS 1 peptide synthetase homolog and 
which hybridizes to a nucleic acid molecule having a sequence corresponding to 
SEQ ID No. 43 as follows : 



15 


CTCGAGGTTA 


GTAAAAGATC 


CCCGTTTGTT 


CGACAAATCT 


CCATCTCCCT 


CTCAATGCCT 


60 




TTCTTGGCGC 


CTCAACCCGC 


TATTTTGAAG 


ACAGTTTGTT 


GTTGTCGCAT 


GCGACCAAAA 


120 


20 


ATCATCCTCT 
AGA7VAGGGTA 


CAAGTTTTCA 
AGCTGCTTTG 


TCGCTGACCT 
CGTCCAGAGT 


GTTTCTTGGC 
ACTTACAATT 


GTAGGAAGGA 
GCTTCTCAAT 


GATATCACAC 
TACTTACGCG 


180 
240 




CCGGCAGCTA 


CCAAAAGCGA 


CGAACTCAAC 


TTTTCTCCCA ATTCCTCGGT 


GCACCTCCAC 


300 


25 


CTCAGATTGC 


TGCTCTCGCC 


GAGCCTCAGT 


CTGGCCTACG 


CATACACTCG 


CCCGATGACT 


360 




CCGACCACCC 


TTCAGGCGAT 


GGCCATCGCG 


CTACCGCCTA 


TGCCGCTCTC 


GGTAGCAGCA 


420 


30 


GCGGTCCAAT 
CTTCAGAATC 


CCCAGATTCA 
ACCAAGACCT 


CCAGACTCAC 
TCTCCAGCAC 


CTATGTACCG 
AACCTCCACC 


ACCGCACTCT 
TTCCCTGCTG 


GGTTATGCTC 
CGCCCGGGGG 


480 
540 




GrTCTCTCGC 


TGGAGGATCG 


ACCACTGCTC 


ACCGCGACTC 


CCTCTTCTTC 


TCCCCCTCCC 


600 


35 


ATCTCGAACC 


TGAAACCCGG 


ACAGGTACTA 


TGATGTCGGG 


CGACTATGCA 


TTCAGACCCG 


660 




AGCAGCAAGG 


CACATATGGC 


GAATCCCAGC 


ATCAACAGCA 


CCAGTTCCAG 


CAACAGCAAC 


720 


40 


AGCCACAGCA 
TCGATTCGCA 


GCAACAGCAG 
AGGATACTTT 


TACGATGGGC 
TCGGATTTTG 


AGCAGTATGA 
CGGGACAGCA 


TGGACGT^ACT 
GCACTATGAT 


ACAACGCTTC 
CAGACTCAAA 


780 
840 




CCGTTGAGTA 


TGTGGGACCT 


CAGCAGCGGT 


ATTCTTCCAG 


CGATGCATTC 


TCTCCAACCG 


900 


45 


CCGCAATGGC 


ACCTCCAATG 


CTTACAACCA ACGACCTCCC ACCGCCGGAA 


GCGCTTGAGT 


960 




ACCAGCTGCC 


CCTTGACCCT 


CGCGAGGTAC 


CATTCGCTAT 


TCAAGATCCC 


CATGATGATT 


1020 


50 


CTACGCCAAT GTCAAAGTTC 
TTGCTAAGAA GCCGGCATAC 


GATAACATCG 
TGGGTGTTGG 


CAGCTGTACT 
ATAGTAAGGG 


CAGACATAGA 
CAAGGAGATT 


GGCCGAACGA 
GCATCGATTA 


1080 
1140 




CGTGGGATAA 


GCTGGCATCT 


AGAGCCGAAA 


AGGTTGCGCA 


AGTCATCCGC 


GACAAAAGCT 


1200 


55 


CTCTGrACCG 


GGGTGATCGG 


GTTGCTCTCA 


TCTACCGCGA 


TTCAGAGGTT 


ATTGATTTCG 


1260 




CCATTGCCTT 


GCTGGGATGC 


TTCATTGCTG 


GAGTTGTTGC 


CGTTCCCATC 


AATGATCTGC 


1320 




AGGACTACCA ACGCTTGAAC 


CACATTCTTA 


CTACAACGCA 


GGCCCATCTA 


GCGCTGACCA 


1380 



60 
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CCGATAACAA CCTCAAAGCC TTTCAACGAG 
AGGGTGTCGA ATGGTGGAAG ACAAACXSAGT 
5 ATGTCCCGGC TTTGGTTGTT CCCGATCTGG 
GAGACTTGAG AGGTGTTGTT CTGAGCCACC 
GTCCGATTAT TTCTACTATC CCGGGTAATG 

10 

GCGACAAGAA TGGTCGACTT ATTGGTGGCG 
TCGATCCCCG TCAGGGCATT GGCATGATTC 
15 ACACCACTGT TTGGTTCGAC AACAAAGCTG 
TTACCAAGTA CAAATCGACC ATCATGATTG 
ACAACTACCA GCAAGAGCCA ATGGTGACCC 

20 

- TTCAAATGAT CAAGCTTTGC TTGATTGACA 
TTTTGGCTGA CCGATGGCTA CGACCGTTGA 
25 CTATGCTTTG TCTACCTGAA CACGGAGGCA 
GAGAAGAGCG CATGGGATGC CCATTAAAGC 
AAGAGAAAGA GGAAACAGAG AAGCCAGCAG 

30 

GTGGTGGCAC AGCAACAACC GAAGAGAGGG 
ATCGTGAGGC TCTAAAGACC AACGAAGTTG 
35 AAAGGGTGAC GGATGACCCA GGCTTGGTAC 
ATGCCACACT CTCCGTCGTC GATCCAGAAA 
GTGAAATCTG GGTCGACTCC CCTTCTCTTT 

40 

CTGAGCTGAT TTTCCATGCT CGTCCTTACA 
CCGTCGAGCC CGAATTCCTG CGAACAGGCT 
45 TTGTTCTGGG CCTTTACGAA GACCGAATTC 
ACGAACTAGC AGAGTACCGC TACtTCTTTG 
ACGTTCCAAA GATATACGAT TGTTCAGCCT 

50 

CAGTCGTGGT GCTGGAGTCA GCAGCTGCGT 
CTCGACAACC GGATACAGCT CTGCTAGAGT 
55 TGTCAGAGCA TCATCTGAGA CTGTACTGCG 
GAGTTGTTAA GAACGGACGA CGCGAAATTG 
TCGGCAACCT TCCATGTGTG CACGTCAAGT 

60 

CTATTGGTGT AGACCCTATA GGTGGTATCT 
AATTCTTATT GCCAGCTGAC AAGCAATACT 
65 ATGACCGTAC TTCAACGCCC CTAAACAATT 



ACATTACTAC ACAAAAGTTG ACATGGCCAA 1440 

TTGGCAGTTA TCACCCCAAG AAGAAGGAGG 1500 

CATATATCGA GTTTTCGCGG GCCCCAACTG 1560 

GAACCATTAT GCACCAAATG GCTTGTCTTA 1620 

GACCTGGCGA CACTTTCAAC CCGTCTCTTC 1680 

GCGCAAGCAG CGAAATTTTG GTGTCGTACC 1*740 

TGAGCGTGCT ACTGACCX3TC TACGGCGGCC 18O0 

TTGATGTTCC TGGACTGTAC GCCCACCTCC 1860 

CCGACTACCC AGGATTGAAG CGAGCCGCCT 1920 

GAAATTTTAA GAAGGGAATG GAGCCAAACT 1980 

CCTTGACTGT AGACAGCGGG TCCCACGAAG 2 04 0 

GAAACCCTCG TGCCCGTGAG GTTGTCGCAC 2100 

TGGTGATTAG TGTGCGTGAC TGGCTAGGAG 2160 

TTGAACTTGG GGAGGATACA GAGTCTGACG 2220 

TTTCCAATGG CTTTGGTAGT CTCTTGTCAG 2280 

CAAAGAATGA GCTTGGCGAA GTCCTTTTGG 2340 

TGGTGGTGGC CATAGGTAAC GATGCCCGTA 24 00 

GGGTCGGTTC TTTTGGATAC CCCATACCCG 24 60 

CGGGTTTACT GGCGTCACCA CATTCCGTGG 2520 

CAGGTGGTTT CTGGGCGCAG CCAAAGAATA 2580 

AGTTTGACCC AGGTGATCCT ACACCGCAGC 2640 

TGCTGGGCAC CGTCATCGAG GGTAAAATCT 2700 

GACAAAAGGT TGAGTGGGTT GAGCATGGAC 27 60 

TTCAGCACAT CGTTGTGAGC ATTGTCAAGA 2820 

TTGACGTCTT TGTCAATGAC GAACACCTGC 28 80 

CAACGGCACC ATTGACATCT GGAGGACCTC 294 0 
CATTGGCTGA GCGCTGCATG GAGGTTCTCA 3000 
TTATGATCAC AGCACCCGAC ACTTTGCCTC 3060 
GTAACATGCT TTGCCGTCGG GAGTTTGATC 3120 
TTGGCGTGGA GCATGCAGTA CTTAACCTCC 3180 
GGTCACCGTT GGCGTCCGAT TCTCGTGCCG 324 0 
CTGGTGTCGA CAGGCGCGAA GTCGTTATCG 33 OO 
TCTCTTGCAT TTCGGATCTT ATCCAATGGC 33 60 
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GCGTGGCCCG TCAACCAGAA GAGCTAGCGT 
GTAAGGGTGT AACATGGAAG AAATTCGACA 
5 AGAACAAGGT CAAGGTGAGG CCGGGAGACC 
AGTTTGTCTT TGCCATCCAT GCCTGCATTT 
CCCTCGACCA GAACCGATTG AACGAAGATG 

10 

ACAACGTCAA GGCTGTGCTG GTCAACGCTG 
TGGCTAGCCA TATCAAACAG TCAGCCCAGG 
15 ACACAACTAA GCCGCCAAAG CAAAGTAGTG 
CTGCCTGGAT TCGGCCTGGC TACCCCGTCA 
GACGAATTTC AGTTCAGCTT GGACATGACA 

20 

AAACTTGGCA AATGACAAGT TCAAGACCTG 
TAGGCTTTAT TCATACGGCT CTGATGGGAA 
25 CACCTGTCGA GTTTGCAGCC AACCCCATGT 
TTAAGGATAC TTATGCGACA CCACAGATGC 
AGGGCTTTAC ACTTCATGAA CTTAAGAACA 

30 

TTGATGTTTT CCAAAAGGTC AGACTTCACT 
TTAACACGGT CTATTCGCAT GTCCTCAACC 
35 TCGAGCCTAT TGAGCTTTGG TTGGACACGC 
TGGACCCTGA ATCAGATCCT CTGGCCCTAC 
CAACCCAAAT AGCCATCATC AACCCTGAAA 

40 

GTGAAATTTG GGTCGACTCT GAAGCCTGCG 
TTGACGCTGA GCGCTTTGAT GGCCGAGCTC 
45 GTACCGGAGA CTTGGGTTTC CTTCATAATG 
AGGTGGACAT GCAAGTGTTG TTTGTTCTCG 
GATTGAGCCA TTTCCCAATG GATATTGAGA 

50 

TGGCGAATGG CTGGTAAGTA TAAAATCTCT 
GCGGTGTTCC AAGCTGGTGG CTTGGTGGTT 
55 CTGGCATCX3A TTGTTCCCGT CATTGTCAAC 
GATATCGTCG CATTCGTCAA CAAGGGAGAC 
CGTGGCAAGA TTCTTGGTGG CTGGGTTAGT 

60 

ATTCGCGATA TGGACGCCGA ATCCACAGCT 
ATGGTCAGCG TACGAAGCGG AGGCGGTGCT 
65 GAACCTGCGC CTCAAATCTT GGAGGAGGAA 
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ACTGCACAAT CGATGGCAAA AGCCGAGAAG 34 20 

CCAAGGTCGC TTCCGTTGCC ATGTACCTGA 34 80 

ACATCATCCT CATGTACACA CATTCAGAGG 354 0 

CCTTGGGCGC AATTGTCATT CCCATCGCAC 3600 

TCCCAGCTTT CCTGCATATT GTATCTGATT 3660 

AGGTCGATCA TCTAATCAAG GTAAAGCCTG 3720 

TTCTCAAGAT CACGAGCCCT GCCATCTACA 3760 

GATTGAGGGA TTTGAGATTC ACCATTGACC 3 84 0 

TTGTTTGGAC TTATTGGACC CCCGATCAAC 3900 

CCATTATGGG CATGTGCAAG GTTCAAAAGG 3 960 

TGCTTGGATG TGTACGAAGC ACGACTGGCC 4 020 

TTTATATCGG AACACCAACC TACCTCCTAT 4 080 

CTCTATTCGT CACCTTGTCG AGATACAAGA 4140 

TTGATCATGC CATGAACTCC ATGCAGGCCA 4 200 

TGATGATCAC TGCCGAGAGC CGACCAAGAG 4260 

TTGCTGGGGC TGGGCTCGAT AGAACTGCTA 4320 

CCATGGTAGC GTCGCGATCT TATATGTGCA 4 380 

AAGCGCTTCG ACGTGGTCTG GTTATTCCTG 444 0 

TGGTACAGGA CAGCGGTATG GTTCCAGTTT 4 500 

GCAGAATACA CTGCCTGGAT GGTGAGTATG 4560 

TCAAGTCATT CTATGGCTCC AAAGACGCTT 4620 

TTGACGGCGA TCCCAACATT CAGTATATCC 4680 

TTAGTCGACC TATTGGCCCT AATGGTGCCC 4 740 

GCAACATTGG CGAGACTTTT GAGATCAACG 4 800 

ACTCGGTGGA AAAATGCCAC AGAAACATTG 4860 

ATTTGAAGCG AATATGCTAA CAAAGTCAGT 4 920 

GTTCTGGTTG T^GTCAACCG CAAGCCATAC 4980 

GCTATCCTCA ATGAACACCA AATCATTGTA 5040 

TTCCCACGGT CTCGTCTAGG AGAGAAGCAG 5100 

AGAAAGCTGA GGACTCTTGC CCAGTTCTCG 5160 

GGTGATATGA TGGATCCTTC TAGAGCATCA 5220 

GCTCCCGGAT CTTCTAGTTT GAGGAATGTC 5280 

CATGACCAGA TGACTCCTCG TCACGAATAC 534 0 
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10 



15 



20 



GAAGCAGCCC 


CTACCATGAT 


TTCTGAACTT 


CCCGACGGCC 


AAGAGACACC 


GACAGGGTTT 


5400 


CAGCACTCGC 


AATACGAACA 


CCCACCACAA 


TCAGCCGGTT 


CTCAAGCACC 


AGCCCAGCTG 


5460 


AACCTTTCTC 


ACCAGCCCGA 


TCAAGGATTC 


GATATGGACT 


TTTCACGATA 


TAGTTCAGCA 


5520 


GAGCCCGATC 


ACGGCCCTGT 


CCACAGACGT 


CCAGTCCCAG 


GCCAAGCCCA 


ACAACCCGAG 


5580 


CCTATGCAAG 


GGTACGGTCA 


AGCGCCGCCC 


CAGATCCGGC 


TACCAGGTGT 


TGATGGACGA 


5640 


GAGGAGGGAG 


GGTTCTGGTC 


ACAGCAGGAA 


AAGAACGAGA 


AGAGTGAAGA 


AGACTGGACA 


5700 


ACTGATGCCA 


TGATGCATAT 


GAATCTGGCA 


GGTGATATGA 


AACCGCCACG 


ATGATAATAC 


5760 


ACAACATAAG 


AGCGAAGTGA 


CGAAGCGGAG 


TCGGAGTTGG 


GAAGCATTTA 


GAAACGAATA 


5820 


ACAAACAATT 


GGACTTGTCG 


GTCTGATGGC 


CTATTTACTT 


CATTCATAGA 


TGAGGATTGG 


5880 


ATAGTGAATA 


TGTGATTGGA 


TAAAGCCTGG 


GTTTGTGAGT 


TTGTGAATGC 


AGTGGGTGCT 


5940 


TGCTATAAGC 


TGTTTTATTG 


AGGTCTTTGG 


AGGAGTGTCT 


AACAAAGATG 


CAAAGTTACT 


6000 


AGT 












6003 



25 Preferably, the CPSl nucleic acid molecule which hybridizes to a 

nucleic acid molecule having a sequence as set forth in SEQ ID No. 43 is from the 
plant pathogen Fusarium graminearium. In another preferred embodiment, the 
CPSl gene is from a plant pathogen such as Fusarium avenaceum, F, carpineum, 
F. chlamydosporum, F. coccophilum, F. culmorum, F. episphaeria, F, equiseti, F. 

30 Jlocciferum, F. moniliforme, F. oxysporum, F. redolem, F, sambucinum, F. solani, 
F. subglutinans, F, irichothecioides, F, udum, or F, veniricosum. 

The peptide synthetase product of SEQ ID NO:43 has a deduced 
amino acid sequence as follows (SEQ ID No. 44): 



35 



40 



45 



50 



MMSGDYAFRP 


EQQGTYGESQ 


HQQHQFQQQQ 


QPQQQQQYDG 


QQYDGRTTTL 


LDSQGYFSDF 


60 


AGQQHYDQTQ 


TVEYVGPQQR 


YSSSDAFSPT 


AAMAPPMLTT 


NDLPPPEALE 


YQLPLDPREV 


120 


PFAIQDPHDD 


STPMSKFDNI 


AAVLRHRGRT 


lAKKPAYWVL 


DSKGKEIASI 


TWDKLASRAE 


180 


KVAQVIRDKS 


SLYRGDRVAL 


lYRDSEVIDF 


AIALLGCFIA 


GWAVPINDL 


QDYQRLNHIL 


240 


TTTQAHLALT 


TDNNLKAFQR 


DITTQKLTWP 


KGVEWWKTNE 


FGSYHPKKKE 


DVPALWPDL 


300 


AYIEFSRAPT 


GDLRGWLSH 


RTIMHQMACL 


SAIISTIPGN 


GPGDTFNPSL 


RDKNGRLIGG 


360 


GASSEILVSY 


LDPRQGIGMI 


LSVLLTVYGG 


HTTVWFDNKA 


VDVPGLYAHL 


LTKYKSTIMI 


420 


ADYPGLKRAA 


YNYQQEPMVT 


RNFKKGMEPN 


FQMIKLCLID 


TLTVDSGSHE 


VLADRWLRPL 


480 


RNPRAREWA 


PMLCLPEHGG 


MVISVRDWLG 


GEERMGCPLK 


LELGEDTESD 


EEKEETEKPA 


540 


VSNGFGSLLS 


GGGTATTEER 


AKNELGEVLL 


DREALKTNEV 


VWAIGNDAR 


KRVTDDPGLV 


600 


RVGSFGYPIP 


DATLSWDPE 


TGLLASPHSV 


GEIWVDSPSL 


SGGFWAQPKN 


TELIFHARPY 


660 
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IS 



25 



35 



45 



55 



KFDPGDPTPQ PVEPEFLRTG LLGTVIEGKI FVLGLYEDRI RQKVEWVEHG HELAEYRYFF 72 0 

VQHIWSIVK NVPKIYDCSA FDVFVNDEHL PVWLESAAA STAPLTSGGP PRQPDTALLE 780 

^ SLAERCMEVL MSEHHLRLYC VMITAPDTLP RWKNGRREI GNMLCRREFD LGNLPCVHVK 84 0 

FGVEHAVLNL PIGVDPIGGI WSPLASDSRA EFLLPADKQY SGVDRREWI DDRTSTPLNN 900 

10 FSCISDLIQW RVARQPEELA YCTIDGKSRE GKGVTWKKFD TKVASVAMYL KNKVKVRPGD 960 

HIILMYTHSE EFVFAIHACI SLGAIVIPIA PLDQNRLNED VPAFLHIVSD YNVKAVLVNA 1020 

EVDHLIKVKP VASHIKQSAQ VLKITSPAIY NTTKPPKQSS GLRDLRFTID PAWIRPGYPV 1080 

IVWTYWTPDQ RRISVQLGHD TIMGMCKVQK ETCQMTSSRP VLGCVRSTTG LGFIHTALMG 114 0 

lYIGTPTYLL SPVEFAANPM SLFVTLSRYK IKDTYATPQM LDHAMNSMQA KGFTLHELKN 1200 

20 MMITAESRPR VDVFQKVRLH FAGAGLDRTA INTVYSHVLN PMVASRSYMC lEPIELWLDT 1260 

QALRRGLVIP VDPESDPLAL LVQDSGMVPV STQIAIINPE SRIHCLDGEY GEIWVDSEAC 1320 

VKSFYGSKDA FDAERFDGRA LDGDPNIQYI RTGDLGFLHN VSRPIGPNGA QVDMQVLFVL 1380 

GNIGETFEIN GLSHFPMDIE NSVEKCHRNI VANGCAVFQA GGLVWLVEV NRKPYLASIV 1440 

PVIVNAILNE HQIIVDIVAF VNKGDFPRSR LGEKQRGKIL GGWVSRKLRT LAQFSIRDMD 1500 

30 AESTAGDMMD PSRASMVSVR SGGGAAPGSS SLRNVEPAPQ ILEEEHDQMT PRHEYEAAPT 1560 

MISELPDGQE TPTGFQHSQY EHPPQSAGSQ APAQLNLSHQ PDQGFDMDFS RYSSAEPDHG 1620 

PVHRRPVPGQ AQQPEPMQGY GQAPPQIRLP GVDGREEGGF WSQQEKNEKS EEDWTTDAMM 1680 
HMNLAGDMKP PR 



1692 



In another embodiment of the invention, there is provided an 
isolated nucleic acid molecule encoding a CPSl peptide synthetase homoiog and 
40 which hybridizes to a nucleic acid molecule having a sequence corresponding to 
SEQIDNo, 45 as follows : 



60 



AAAAAGAAGG GGCCTACGGA GTTGACCGAG ATATTGCTAG ATAAGGAAGC GCTCAAGATG 

AACGATGTTG TGGTCCTTGC AATAGGAGAA GAGGCCAGTA AACGTGCGAA TGAGCCTGGC 120 

ACAATGCGAG TTGGCX5CTTT TGGATACCCA ATACCAGATG CGACGCTAGC CGTCGTAGAT 180 

CCAGAGACGA ATCTCTTGTG TTCACCCTAC TCGATAGGAG AGATTTGGGT AGACTCACCT 24 0 

50 TCATTGTCTG GTGGTTTCTG GCAATTGCAG AAGCACACTG AAACTATATT TCACGCCCGC 300 

CCATACCGCT TTGTGGAGGG CAGTCCTACC CCGCAGTTGC TTGAGCTTGA GTTTCTCCGG 360 

ACAGGCTTAC TCGGATTCGT CGTAGAGGGC AAGGTCTTTA TCCTTGGTCT CTATGAAGAT 420 

CGCATCAGGC AGCGCGTTGA ATGGGTAGAA CATGGTCAGC TGGAAGCTGA ACACAGATAC 4 80 

TTCTTCGTGC AGCACCTCX3T CACCAGTATC ATGAAGGCTG TTCCCAAGAT CTACGACTGG 540 

60 TAAGTCTTCT CATGTTTTAG ATGAGCGTTC TAACACTATG CAGCTCATCT TTCGACTCGT 600 
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ACGTCAATGG CGAATACCTG CCTATCATCC TCATCGAGAC ACAGGCTGCA TCGACAGCCC S60 

CTACGAACCC TGGTGGACCG CCACAGCAAC TCGACATCCC CTTCCTAGAC TCACTGTCTG 720 

^ AGCGATGCAT GGAAGTGTTG TATCAAGAAC ACCATCTGCG AGTATACTGC GTCATGATCA 780 

CAGCGCCAAA CACATTACCA CGAGTTGTTA AGAATGGTCG ACGAGAAATT GGCAACATGC 84 0 

10 TCTGTCGAAG AGAAITTCAT AATGGCTCAT TACCTTGTGT CCACGTCAAG TTTGGTGTTG 900 
AGAGGTCAGT TCTCAACATC GCGTTGGGTG ATGACCCCTC CGGAGGCATG TGGTCATATG 960 

AAGCCTCX3AT GGCGCX3TCAG CAGTTCTTGA TGCTCCAAGA CAAGCAGTAT TCTGGAGTAG 1020 

ATCACCGCGA AGTCGTCATG GATGACAGAA CATCGACACC TCTCAACCAA TTCTCCAACA lOBO 

TTCACGACCT CATGCAATGG CGCGTATCAC GGCAGGCTGA AGAGCTCGCA TATTGCACAG 114 0 

20 TCGACGGTCG AGGCAAAGAA GGCAAGGGCG TCAACTGGAA GAAGTTCGAC CAGAAAGTTG 1200 

CGGGTGTCGC AATGTACCTG AAGAACAAGG TCAAAGTGCA AACCGGCGAT CATCTGCTTC 126 0 

TGATGTATAC GCACTCGGAA GACTTTGTAT ATGCGGTACA TGCATGCTTT GTGCTTGGCG 1320 

25 

CTGTATGCAT ACCAATGGCA CCAATCGACC AGAACCGATT GAATGAGGAT GCACCTGCAT 1380 

TGCTGCACAT CCTTGCAGAC TTCAAGGTCA AGGCCATCCT CGTCAATGCC GATGTGGATC 1440 

30 ATCTCATGAA GGTCAAGCAA GTATCGCAGC ACATCAAACA ATCAGCAGCC ATCTTCAAGA 15O0 

TCAACGTGCC GCACACTTAC AACACAACCA AGCCACCTAA GCAGTCGAGT GGTTGTCGGG 1560 

ATCTCAAGCT CACAATACGG CCTGCCTGGG TACAGCCTGG TTTCCCAGTT CTTGTATGGA 1620 

35 

CATACTGGAC TCCAGATCAA CGCCGTATAG CCGTACAACT AGGTCATAGC CAAATCATGG 1680 

CACTAGGCAA GGTCCAGAAG GAGACTTGTC AAATGACAAG TACAAGGCCA GTCCTAGGAT 1740 

40 GTGTACGGAG TACCATCGGA CTTGGCTTCA TTCATACCTG CATCATGGGC ATCTTCCTTG 1800 

CCGCACCCAC TTACCTCGTG TCGCCTGTCG ACTTTGCACA AAATCCAAAC ATACTCTTCC 1860 

AGACGTTATC AAGATACAAG ATCAAGAATG CGTACGCAAC CAGTCAAATG TTGGATCACG 1920 

45 

CTATTGCCCG TGGGGCTGGA AAGAACATGG CCCTGCACGA ACTCAAGAAT CTCATGATTG 1980 

CGACTGATGG TAGGCCGCGT GTTGATGTTT ACCAGAGAGT GCGCGTACAC TTTTCACCAG 2040 

50 CAAGCTTGGA CCGGACAGCG ATTAACACAG TCTACTCTCA CGTGCTCAAC CC7UVTGGTAG 2100 

CATCGCGATC ATACATGTGC ATCGAGCCAA TAGAACTGCA TCTCGACGTC AACGCTCTTC 2160 

GAAGAGGTCT GATCATGCCC GTCGACCCAG ATACCGAGCC TGGCGCTCTA ATGGTCCAGG 2220 

ACTCTGGTAT GGTGCCAGTC TCCACACAAA TAGCAATTGT GAACCCAGAG ACAAACCAGC 22 80 



TTTGCTTGGT TGGCGAATAT GGCX3AAATCT GGGTTCAATC 



2320 



60 Preferably, the CPSl nucleic acid molecule which hybridizes to a 

nucleic acid molecule having a sequence as set forth in SEQ ID No. 45 is from the 
plant pathogen Pyrenophora teres. In another preferred embodiment, the CPS 1 
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gene is from a plant pathogen such as Pyrenophora avenae, P. bromi, P. 
leuceienes, P. phaeocomes, P. schroeteri, P. irichostoma, or P, iritici-repentis. 



The peptide synthetase product of SEQ ID No:45 has a deduced 





amino acid sequence as follows (SEQ ID No. 46): 






5 


KKKGPTELTE ILLDKEALKM NDVWLAIGE 


EASKRANEPG 


TMRVGAFGYP 


IPDATLAWD 


60 




PETNLLCSPY SIGEIWVDSP SLSGGFWQLQ 


KHTETIFHAR 


PYRFVEGSPT 


PQLLELEFLR 


120 


10 


TGLLGFWEG KVFILGLYED RIRQRVEWVE 


HGQLEAEHRY 


FFVQHLVTSI 


MKAVPKIYDC 


180 


SSFDSYVNGE YLPIILIETQ AASTAPTNPG 


GPPQQLDIPF 


LDSLSERCME 


VLYQEHHLRV 


240 




YCVMITAPNT LPRWKNGRR EIGNMLCRRE 


FDNGSLPCVH 


VKFGVERSVL 


NIALGDDPSG 


300 


15 


GMWSYEASMA RQQFLMLQDK QYSGVDHREV 


VMDDRTSTPL 


NQFSNIHDLM 


QWRVSRQAEE 


360 




LAYCTVDGRG KEGKGVNWKK FDQKVAGVAM 


YLKNKVKVQT 


GDHLLLMYTH 


SEDFVYAVHA 


420 


20 


CFVLGAVCIP MAPIDQNRLN EDAPALLHIL 
AAIFKINVPH TYNTTKPPKQ SSGCRDLKLT 


ADFKVKAILV 
IRPAWVQPGF 


NADVDHLMKV 
PVLVWTYWTP 


KQVSQHIKQS 
DQRRIAVQLG 


480 
540 




HSQIMALGKV QKETCQMTST RPVLGCVRST 


IGLGFIHTCI 


MGIFLAAPTY 


LVSPVDFAQN 


600 


25 


PNILFQTLSR YKIKNAYATS QMLDHAIARG 


AGKNMALHEL 


KNLMIATDGR 


PRVDVYQRVR 


660 




VHFSPASLDR TAINTVYSHV LMPMVASRSY 


MCIEPIELHL 


DVNALRRGLI 


MPVDPDTEPG 


720 




ALMVQDSGMV PVSTQIAIVN PETNQLCLVG 


EYGEIWVQ 






758 



30 



As used herein, the term "nucleic acid" refers to deoxyribonucleic 
acid (DNA) or ribonucleic acid and polymers thereof in either a single or double 
stranded form. As used herein, "nucleic acid" also encompasses nucleic acids 
containing known analogs of naturally occurring nucleotides. Unless otherwise 

35 indicated, a particular nucleic acid sequence also encompasses conservatively 
modified variants thereof such as degenerate codon substitutions and 
complementary sequences. As used herein, the term "nucleotide sequence" refers 
to a polymer of DNA or RNA which may be single or double stranded and may 
contain synthetic, non-natural, or altered nucleotide bases capable of incorporation 

40 into DNA or RNA polymers. The terms "nucleic acid" , "nucleic acid molecule", 
"nucleic acid fragment", or nucleic acid sequence or segment", may also be used 
interchangeably with the terms "gene", "cDNA", "DNA" and "RNA". 

Other DNA molecules of the present invention include DNA 
molecules that have a nucleic acid sequence which is more than 70% identical to 
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the nucleotide sequence of SEQ. ID. Nos. 2,41. 43, or 45. Nucleotide sequence 
similarity may be determined by the BLAST program with the default parameters 
(Altschul et al., "Basic Local Alignment Search Tool," J. Mol. Biol., 215:403-410 
(1990), which is hereby incorporated by reference). 

5 Preferred sequences include those DNA molecules which will 

hybridize to a nucleic acid molecule having the sequence of SEQ. ID No. 2, 41 , 
43, 45 or their compliments. Generally, stringent conditions are selected to be 
about 50°C lower than the thermal melting point (T„) for the specific sequence at 
a defined ionic strength and pH. The T„ is the temperature (under defined ionic 

10 strength and pH) at which 50% of the target sequence hybridizes to a perfectly 
matched probe. The T™ is dependem upon the solution conditions and the base 
composition of the probe, and may be calculated using the following equation: 

Tm = 79.8°C + ( 1 8.5 X Log[Na+]) 
+ (58.4°C X %[G+C]) 
J 5 - (820 / #bp in duplex) 

(0.5 X % formamide) 
More preferred stringent conditions are when the temperature is 
20''C below Tm, and the most preferred stringent conditions are when the 
temperature is 10°C below T^. Nonspecific binding may also be controlled using 
20 any one of a number of known techniques such as, for example, blocking the 
membrane with protein-containing solutions, addition of heterologous RNA, 
DNA, and SDS to the hybridization buffer, and treatment with RNase. 

Wash conditions are typically performed at or below stringency. 
Generally, suitable stringent conditions for nucleic acid hybridization assays or 
25 gene amplification detection procedures are asas set forth above. More or less 
stringent conditions may also be selected. 

For the purposes of defining the level of stringency, reference can 
conveniently be made to Sambrook, J., E.F. Fritsch, et al. 1989 "Molecular 
Cloning: a Laboratory Manual, 2""* Edition, Cold Spring Harbor, NY, Cold Spring 
30 Harbor Laboratory Press, at 1 1 .45. An example of low stringency conditions is 
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4-6X SSC/0. 1 -0.5% w/v SDS at 37M5° C for 2-3 hours. Depending on the 
source and concentration of the nucleic acid involved in the hybridization, 
alternative conditions of stringency may be employed such as medium stringent 
conditions. Examples of medium stringent conditions include 1-4X SSC/0.25% 

5 w/v SDS at > 45° C for 2-3 hours. An example of high stringency conditions 
includes 0.1-lX SSC/0.1% w/v SDS al 60 C for 1-3 hours. The skilled artisan is 
aware of various parameters which may be altered during hybridization and 
washing and which will either maintain or change the stringency conditions. For 
example, another stringent hibridization condition is hybridization at 4X SSC at 

1 0 65° C, followed by a washing in 0. IX SSC at 65° C for about one hour. 
Alternatively, an exemplary stringent hybridization condition is in 50% 
formamide, 4XSSC, at 42° C. Still another example of stringent conditions 
include hybridization at 62° C in 6X SSC, .05X BLOTTO, and washing at 2X 
SSC, 0.1% SDS at 62° C. 

1 5 Other proteins or polypeptides of the present invention include 

polypeptides that have an amino acid sequence having at least 75 % similarity to 
the amino acid sequence of at least one of SEQ. ID. No. 3, SEQ ID No. 42, SEQ 
ID No. 44, or SEQ ID No. 46. In a preferred embodiement of the invention, the 
protein or polypeptide will have at least 90% similarity with at least one of SEQ. 

20 ID No. 3, SEQ ID No. 42, SEQ ID No. 44, or SEQ ID No. 46. Protein sequence 
similarity may be determined by the BLAST program with the default parameters 
(Altschul et al., "Basic Local Alignment Search Tool," J. Mol. Biol.. 215:403-410 
(1990), which is hereby incorporated by reference). The CPSl protein of SEQ. 
ID. No. 3 has a molecular weight of about 190-200 kDa, preferably 193.2 kDa. 

25 The CPSl protein contains two structurally similar modules, both of which are 
similar to SafBl, the first module of saframycin synthetase B (overall 25% 
identity; 50% similarity) and have apparent amino-acid-activating and thiolation 
domains with core sequences conserved in known peptide synthetases. 

The DNA molecule encoding a CPSl protein or polypeptide of the 
30 present invention can be incorporated in cells using conventional recombinant 
DNA technology. Generally, this involves inserting the DNA molecule into an 
expression system to which the DNA molecule is heterologous (i.e., not normally 
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present). The heterologous DNA molecule is inserted into the expression system 
or vector in proper sense orientation and correct reading frame. The vector 
contains the necessary elements for the transcription and translation of the inserted 
protein-codin^^sequences. U.S. Patent No. 4,237,224 to Cohen and Boyer, which 

5 is hereby incorporated by reference, describes the production of expression 
systems in the form of recombinant plasmids using restriction enzyme cleavage 
and ligation with DNA ligase. These recombinant plasmids are then introduced 
by means of transformation and replicated in unicellular cultures including 
procaryotic organisms and eukaryotic cells grown in culture. Recombinant genes 

10 may also be introduced into viruses, such as vaccinia virus. Recombinant viruses 
can be generated by transfection of plasmids into cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtl 1, gt WES.tB, Charon 4, and plasmid 
vectors such aspBR322, pBR325, pACYC177, pACYC184, pUC8, pUC9, 

15 pUCl 8, pUC19, pLG339, pR290, pKC37, pKClOl, SV 40, pBluescript II SK +/- 
or KS +/- (see "Stratagene Cloning Systems" Catalog (1993) from Stratagene, La 
JoUa, Calif, which is hereby incorporated by reference), pQE, pIH821, pGEX, 
pET series (see Studier et. al., "Use of T7 RNA Polymerase to Direct Expression 
of Cloned Genes," Gene Expression Technology , vol. 185 (1990), which is hereby 

20 incorporated by reference), and any derivatives thereof. Suitable vectors are 
continually being developed and identified. Recombinant molecules can be 
introduced into cells via transformation, transduction, conjugation, mobilization, 
or electroporation using methods well known in the art. The DNA sequences are 
cloned into the vector using standard cloning procedures in the art, as described by 

25 Maniatis et al.. Molecular Cloning: A Laboratory Manual . Cold Springs 

Laboratory, Cold Springs Harbor, New York (1982), which is hereby incorporated 
by reference. 

A variety of host- vector systems may be utilized to express the 
protein-encoding sequence(s). Primarily, the vector system must be compatible 
30 with the host cell used. Host-vector systems include but are not limited to the 
following: bacteria transformed with bacteriophage DNA, plasmid DNA, or 
cosmid DNA; microorganisms such as yeast containing yeast vectors; mammalian 
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cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus (e.g.. baculovirus); and plant cells infected by bacteria 
or transformed via particle bombardment (i.e. biolistics). The expression elements 
of these vectors vary in their strength and specificities. Depending upon the host- 
5 vector system utilized, any one of a number of suitable transcription and 
translation elements can be used. 

Different genetic signals and processing events control many levels 
of gene expression (e.g., DNA transcription and messenger RNA ("mRNA") 
translation). Transcription of DNA is dependent upon the presence of a promoter 

1 0 which is a DNA sequence that directs the binding of RNA polymerase and thereby 
promotes mRNA synthesis. The DNA sequences of eukaryotic promoters differ 
from those of procaryotic promoters. Furthermore, eukaryotic promoters and 
accompanying genetic signals may not be recognized in or may not fiinction in a 
procaryotic system, and, further, procaryotic promoters are not recognized and do 

1 5 not function in eukaryotic cells. Similarly, translation of mRNA in procaryotes 
depends upon the presence of the proper procaryotic signals which differ from 
those of eukaryotes. Efficient translation of mRNA in procaryotes requires a 
ribosome binding site called the Shine-Dalgamo ("SD") sequence on the mRNA. 
This sequence is a short nucleotide sequence of mRNA that is located before the 

20 start codon, usually AUG, which encodes the amino-terminal methionine of the 
protein. The SD sequences are complementary to the 3 '-end of the 16S rRNA 
(ribosomal RNA) and probably promote binding of mRNA to ribosomes by 
duplexing with the rRNA to allow correct positioning of the ribosome. For a 
review on maximizing gene expression, see Roberts and Lauer, Methods in 

25 Enzvmolo£V, 68:473 (1979), which is hereby incorporated by reference. 

Promoters vary in their "strength" (i.e., their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
strong promoters in order to obtain a high level of transcription and, hence, 
expression of the gene. Depending upon the host cell system utilized, any one of 
30 a number of suitable promoters may be used. For instance, when cloning in E. 
coli, its bacteriophages, or plasmids, promoters such as the T7 phage promoter, 
lac promoter, trp promoter, recA promoter, ribosomal RNA promoter, the Pr and 
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Pl promoters of coliphage lambda and others, including but not limited, to 
/ocUV5, ompF, bla, Ipp, and the like, may be used to direct high levels of 
transcription of adjacent DNA segments. Additionally, a hybrid trp-lacXiVS {tac) 
promoter or other E. coli promoters produced by recombinant DNA or other 

5 synthetic DNA techniques may be used to provide for transcription of the inserted 
gene. Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promoter unless specifically induced. In certain operons, 
the addition of specific inducers is necessary for efficient transcription of the 
inserted DNA. For example, the lac operon is induced by the addition of lactose 

10 or IPTG (isopropylthio-beta-D-galactoside). A variety of other operons, such as 
/rp, pro, etc., are under different controls. 

Specific initiation signals are also required for efficient gene 
transcription and translation in procaryotic cells. These transcription and 
translation initiation signals may vary in "strength" as measured by the quantity of 

1 5 gene specific messenger RN A and protein synthesized, respectively. The DNA 
expression vector, which contains a promoter, may also contain any combination 
of various "strong" transcription and/or translation initiation signals. For instance, 
efficient translation in E, coli requires a Shine-Dalgamo ("SD") sequence about 7- 
9 bases 5' to the initiation codon ("ATG") to provide a ribosome binding site. 

20 Thus, any SD-ATG combination that can be utilized by host cell ribosomes may 
be employed. Such combinations include but are not limited to the SD-ATG 
combination from the cro gene or the gene of coliphage lambda, or from the £ 
coli tryptophan E, D, C, B or A genes. Additionally, any SD-ATG combination 
produced by recombinant DNA or other techniques involving incorporation of 

25 synthetic nucleotides may be used. 

The present invention also relates to anti-sense nucleic acid for 
essential cell proteins, such as replication proteins, which serve to render the host 
cells incapable of further cell growth and division. Anti-sense regulation has been 
described by Rosenberg et al., "Production of Phenocopies by Kruppel Antisense 
30 RNA Injection Into Drosophila Embryos," Nature , 313:703-706 (1985); Preiss et 
al., "Molecular Genetics of Kruppel, A Gene Required for Segmentation of the 
Drosophila Embryo," Nature , 313:27-32 (1985); Melton, "Injected Anti-sense 
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RNAs Specifically Block Messenger RNA Translation In vivo," Proc. Natl. Acad. 
Sci. USA . 82:144-148 (1985); Izant et al., "Constitutive and Conditional 
Suppression of Exogenous and Endogenous Genes by Anti-sense RNA " Science, 
229:345-352 (1985); Kim et al., "Stable Reduction of Thymidine Kinase Activity 

5 in Cells Expressing High Levels of Anti-sense RNA," CeH, 42:129-138 (1985); 
Pestka et al., "Anti-mRNA: Specific Inhibition of Translation of Single mRNA 
Molecules," Proc. Natl. Acad. Sci. USA , 81 :7525-7528 (1984); Coleman et al., 
"The Use of RNAs Complementary to Specific mRNAs to Regulate the 
Expression of Individual Bacterial Genes," CeU, 37:429-436 (1984); and McGarry 

1 0 et al., "Inhibition of Heat Shock Protein Synthesis by Heat-Inducible Antisense 
RNA," Proc. Natl. Acad. Sci. USA , 83:399-403 (1986), which are hereby 
incorporated by reference. 

Once the isolated DNA molecules encoding the CPSl protein or 
polypeptide, as described above, has been cloned into an expression system, they 

1 5 are ready to be incorporated into a host cell. Such incorporation can be carried out 
by the various forms of transformation noted above, depending upon the 
vector/host cell system. Suitable host cells include, but are not limited to, 
bacteria, virus, yeast, mammalian cells, insect, plant, and the like. In the present 
invention, the host cells may be from plants such as com, oat, grass, v^^eed, 

20 bamboo, and sugarcane. 

One aspect of the present invention involves using an inhibitor of 
the CPSl protein to interfere with the plant infection process in order to impart 
disease resistance to plants. In one mechanism, the peptide synthetase CPSl 
could be inhibited by an appropriate drug, thereby causing the plants to be 
25 resistant to fungal attack. Alternatively, the CPSl nonribosomal product could be 
degraded by an enzyme for which it is a suitable substrate, and, when the gene 
encoding this enzyme is genetically engineered into plants, the plants will become 
resistant to fungal attack. 

In this aspect of the present invention, large numbers of 
30 compounds can be screened for their activity as inhibitors of CPS 1 protein by a 
high-throughput screening assay as described in U.S. Patent No. 5,876,946 to 
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Burbaum et al., which is hereby incorporated by reference. Generally, a library of 
compounds is assayed for inhibition of an enzyme catalyzed reaction and the 
amounts of fluorescence bound to individual suspendable solid supports measured 
to determine the degree of inhibition. For example, the amount of fluorescence 

5 bound to a microbead in the presence of inhibitory compounds is greater than for 
non-inhibitory compounds. The amounts of fluorescence bound to individual 
beads are determined by confocai microscopy. Using this type of assay, inhibition 
can be determined of a peptide synthetase such as CPSl. For CPSl, the substrate 
can be amino acids (or hydroxy acids), linked at one end to the microbead and at 

1 0 the other end to a fluorescent label. The enzyme inhibitors can be utilized to 
impart fungal resistance to a variety of plants including oats, grasses, weeds, 
sugarcane, and com in particular. 

Thus, the present invention provides a method for identifying 
inhibitors of a CPSl protein , wherein said CPSl protein is a peptide synthetase 

15 of a plant pathogen. The method comprises: providing a CPS 1 protein or 
polypeptide, contacting the protein or polypeptide with potential inhibitor 
compounds, determining peptide synthetase activity, and selecting compounds 
which decrease the peptide synthetase activity. The method is especially useful in 
identifying inhibitors of a CPSl protein from plant pathogens of the genera 

20 Cochliobolus, Alternaia, Fusarium, and Pyrenophora such as those described 

hereinabove. Preferably, the method may be used to identify inhibitors of a CPSl 
protein from Alternaria solanU Fusarium graminearium, and Pyrenophora teres. 

Another aspect of the present invention involves using one or more 
of the above DNA molecules encoding a CPSl protein or polypeptide or a gene 

25 encoding an enzyme that degrades the CPSl N.R.P. product to transform plants in 
order to impart fungal resistance to the plants. This concept of pathogen-derived 
resistance, according to U.S. Patent No. 5,840,481 to Johnston and Sanford, which 
is hereby incorporated by reference, is that host resistance to a particular parasite 
can effectively be engineered by introducing a gene, gene fragment, or modified 

30 gene or gene fragment of the pathogen into the host. This approach is based on 
the fact that in any parasite-host interaction, there are certain parasite-encoded 
cellular functions (activities) that are essential to the parasite but not to the host 
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and that when one of the essential functions of the parasite such as survival or 
reproduction is disrupted, the parasitic process will be stopped. "Disruption" 
refers to any change that diminishes the survival, reproduction, or infectivity of 
the parasite. Such essential functions, which are under the control of the 
5 parasite's genes, can be disrupted by the presence of a corresponding gene product 
in the host which is (1) dysfunctional, (2) in excess, or (3) appears in the wrong 
context or at the wrong developmental stage in the parasite's life cycle. If such 
faulty signals are designed specifically for parasitic cell functions, they will have 
little effect on the host. Therefore, the procedure for making plants, for example, 

10 resistant to infection by one or more fungus involves isolating DNA coding for a 
gene such as CPSl of a fungus, operably linking the DNA within an expression 
vector, transforming the plant cell or plant tissue with the expression vector, and 
growing the transformed plant cells or plant tissue in the presence of the fungus 
such as e.g., Cochliobolus heterostrophus, Altermria solani, Fusarium 

15 graminearium, or Pyrenophora teres, where the CPSl DNA is expressed as a 
gene product and the CPS protein disrupts the essential activity of the fungi. 

Thus, the present invention provides a method of imparting disease 
resistance to a plant by over-expressing a CPSl polypeptide in the plant, wherein 
the polypeptide has protein synthetase activity. In praticing this aspect of the 

20 invention, the plant may be any plant in which it is desired to impart disease 

resistance. Thus, the plant may be an agrigultural crop or ornamental plant. The 
plant may be herbacious or woody. The plant may be a monocot or dicot. 
Examples of plants which may be used in practicing the present invention, include 
but are not limited to, com, oats, grasses, weeds, sugarcane, barley, wheat, rice, 

25 tomato, potato, citrus, malus, rye, cotton, brassica, cabbage, and carrot. Many 
other plants may also be used in the practice of the present invention. As a 
guideline, plants which serve as hosts for Cochliobolus sp., Fusarium sp., 
Alternaria sp., and Pyrenopherasp., may he used. Reference to host plants may 
be conveniently made to Fungi on Plants and Plant Products in the United States, 

30 David, F. Farr et al. editors, American Phytopathological Society Press, St. Paul, 
Minnesota, 1989. 

SUBSTITUTE SHEET (RULE 26) 



wo 01/38489 



-48- 



PCTAJSOO/32227 



In another preferred embodiment, the CPSl peptide is from the 
genera Cochliobolus. Alternaria. Fusarium .or Pyrenophora. In an even more 
preferred embodiment, the CPSl gene is from Cochliobolus heterostrophus, 
Alternaria solani, Fusarium graminearium or Pyrenophora teres. 

5 Promoters and other regulatory regions which function in plants are 

well known and include e.g., constitutive promoters, inducible promoters, 
temporally regulated and tissue specific promoters. Examples of constitutive 
promters include e.g., actin, CAMV 35S, MAS, ubiquitin, rice cyclophilin, maize 
H3 histone, and actin 2. Examples of tissue specific promoters include e.g., leaf 

1 0 specific promoters such as the RuBisCo ssu. Cab (chlorophyll a/b/binding) 

protein, and the AldP gene promoter from rice (Kagaya et al., 1995 Molecular and 
General Genetics 248.66i-674. Examples of root specific promoters include e.g., 
beta tubulin (Oppenheimer et al. Gene 65:87, 1988), and SbPRPl (Suzuki et al., 
Plant Mol Biol 27:109-1 19. 1993. Thus, using well known methods and widely 

1 5 available regulatory sequences, the skilled artisan is direct expression of a subject 
CPS nucleic acid molecule in a plant. 

To provide regulated expression of a CPS gene of the present 
invention, plants are transformed with a vector which replicates in a plant cell and 
which have a promoter which directs expression of the CPS gene product in the 

20 plant. Methods of plant transformation are well known in the art. A vector 
comprising a subject nucleic acid molecule coding for a CPS gene or fragment 
thereof may be inuoduced into a plant by leaf disk transforamtion-regeneration 
procedure as described by Horsh et al. (1985) Science 227:1229-1231. Other 
methods of transformation such as protoplast culture (Horsh et al. 1984 Science, 

25 223:496; DeBlock et al. (1984) Embo J. 2:2143; Barton et al. (1983) Cell, 
32:1033) may also be used and are within the scope of this invention. 

In transforming dicot plant species, plants may be transformed 
Agrobacterium-demtd vectors such as those described in Klett et al. (1987) 
Annu. Rev. Plant Physiol., 35:467. Other well known methods are available to 
30 insert the subject CPS genes into plant cells. Such alternative methods include 
biolistic approaches (Klein et al. 1987, A^o/ure: 327:70), electroporation. 
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microinjection (Potrykus and Spangenberg eds., Gene Transfer to Plants, Springer 
Verlag, Berlin , 1995), chemically-induced DNA uptake, the use of viruses or 
pollen as vectors, liposome mediated transformation, transformation using 
wounded or enzyme-degraded immature embryos, or wounded or enzyme 
5 degraged embryonic callus. 

For transformation of monocot plant species, a number of well 
known methods may be used such as biolistic methods (Tang, K. W. et al. 2000, 
"Acta Biotechnologica. 20(2): 175-183; de-Villiers, S. M. et al., 2000, South 
African Journal of Plant and Soil. 17(1): 50- 53) protoplast transformation, 
1 0 electroporation of partially permeabilized cells, and introduction of DNA using 
glass fibers, etc. Agrobacterium may also be used (Raineri, D. M., et al., 1990 
BioATechnology Vol. 8 :33-38). 

As used herein, "transformation" refers to the transfer of an 
exogenous nucleic acid molecule into a host cell. The nucleic acid molecule may 
1 5 be stably or transiently introduced into the host cell and may be maintained non- 
integrated for example, as a plasmid, or ahematively, may be integrated into the 
host genome. The resulting transformed plant cell(s) may then be used to 
regenerate a transformed plant via standard methods. 

The invention is further illustrated by the following examples 
20 which are not intended in any way to limit the scope of the invention. 

EXAMPLES 

EXAMPLE 1 - Materials and Methods for Examples 2- 7 

Strains, Media, Crosses and Transformation. C4 {Toxl'^\ MAT- 
2) and C5 {Taxi", MAT-]) are member of near-isogenic C heterostrophus strains 

25 (Leach et al., 1982, which is hereby incorporated by reference). R.C4.2696 
{Tox^\ MAT-2\ hygB^) is a C4-derived mutant generated using the REMI 
mutagenesis procedure (Lu et al., "Tagged Mutations at the Taxi Locus of 
Cochliobolus heterostrophus Using Restriction Enzyme-Mediated Integration," 
Prnc. Natl. Acad. Sci. USA , 91:12649-12653 (1994), which is hereby incorporated 

30 by reference). Strains 1301R33 (Jox- MAT-2; hygB\ 1301R45 (7bx-; MAT-1\ 
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hygBl^). 1301R26 (Tox-^ ; MAT-2; /r)/g5^)are progeny of the cross C5 X 
R.C4.2696. Culture media, including CM (complete medium), CNTX (complete 
medium with xylose instead of glucose), CMNS (CM with salts omitted), and MM 
(minimal medium) have been described, as have mating procedures (Leach et al., 

5 1 982; Turgeon et al., "Transfomiation of the Fungal Maize Pathogen 

Cochliobolus heteroslrophus Using the Aspergillus nidulans amdS Gene," MoL 
Gen. Genet. . 201 :450-453 (1985), which are hereby incorporated by reference). 
All strains were grown at 24°C under the warm white light or black light 
(F40/350BL) (Sylvania Inc., Danvers, MA). Ascospore germination was done at 

10 32°C in the dark for 3 days. REMI transformants were purified by transferring the 
transfoimants from the original REM! plates to fresh CMNS medium containing 
hygromycin B (CalbiochemR) at 80 ug/ml. For conidiation, stable transformants 
were transferred to CMX containing the same drug but at a higher concentration 
(120 ug/ml) to compensate for reduced drug activity due to the inhibition by the 

1 5 salts in the medium. Single conidia were picked up under a dissecting microscope 
and grown on CMNS hygromycin B plates; stable colonies were then transferred 
to individual CMX/ hygromycin B plates. All purified transformants were stored 
at -70°C in CM liquid medium containing 25% of glycerol in 96-well microtiter 
dishes. 

20 Bioassays. Fungal strains were grown on CMX plates (100 X 

1 5mm) for 7- 1 0 days at 24''C under the light for maximum conidiation. To verifiy 
normal T-toxin production by a race T isolate, 1.0 ml of T-toxin-sensitive E. coli 
(DH5a) cells were evenly spreaded on LB medium containing ampicillin (100 
ug/ml) and the plates were allowed to air dry for 30 min in a laminar hood. Agar 

25 plugs bearing fungal mycelia were inoculated (upside down) onto the E. coli cell 
lawn and the plates were incubated at 32°C. Wild type race T and race O were 
used as controls for each assay plate. T-toxin-producing strains of the fungus will 
inhibit growth of the E. coli cells and produce halos. Toxr mutants can be 
distinguished from wild type by failure to produce a halo (tight) or by production 

30 of halos smaller (leaky) or larger than wild type (overproducing). All Tox' 
mutants were transferred to Fries medium (Pringle et al.. "The Isolation of the 
Toxin of Helminthosporium victoriae" Phytopathology. 47:369-371 ( 1957), 
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which is hereby incorporated by reference), which optinriizes toxin production, and 
retested. T-cytoplasm com plants (inbred W64A) are used to verify the Toxr 
mutants identified from the E. coli assay using the procedure described below. 
Mutants defective in T-toxin production fail to produce typical race T symptoms 
on T-com. Pathogenicity phenotype on N-cytoplasm com and virulence of Tox^ 
strains to T-cytoplasm com were detemiined by a plant assay where about 3,000 
transformants generated using the REMI mutagenesis procedure (Lu et al., 
"Tagged Mutations at the Toxl Locus of Cochliobolus heterostrophus Using 
Restriction Enzyme-Mediated Integration," Proc.Natl. Acad. Sci.USA , 
91:12649-12653 (1994), which is hereby incorporated by reference) were 
screened for mutants defective in ability to cause disease on com plants. Two 
week old N-cytoplasm com plants (inbred W64A) grown in the green house (5-6 
plants in one 4" X 6" pot) were inoculated with 5 ml conidial suspensions (10^ 
conidia/ml) using a pressurized Preval Spray Gun Power Unit thin layer 
15 chromatography sprayer (Alltech Associates, Deerfield, IL), incubated in the mist 
chamber for 24 hours (23''C) and then taken to the growth chamber (23"C, 80% 
humidity, 14 hours of light). The mutant phenotypes were determined by 
occurrence of apparent variations in disease symptom development, mainly by 
lesion size comparison. Mutants producing lesions smaller than wild type were 
20 retested and lengths of typical lesions from each mutant were compared with wild 
type 7 days after inoculation and measurements were taken for statistical 
evaluation. 

DNA manipulations and sequencing. Genomic and plasmid DNA 
preparation, restriction enzyme digestions, gel electrophoresis and gel blot 

25 analysis were done using standard protocols (Sambrook, et al.. Molecular 

rinning: A Laboratorv Manual . 2"" Ed., Cold Spring Harbor, New York:Cold 
Spring Harbor Laboratory Press (1989), which is hereby incorporated by 
reference). DNA was sequenced at the Cornell DNA Sequencing Facility using 
TaqCycle automated sequencing with DyeDeoxy terminators (Applied 

30 Biosystems, Foster City, CA). pUCATPH was used for subcloning (Table 1 ). 
Primers used for sequencing (Table 2) were designed using Primer Select 
(DNASTAR Inc., LaserGene system) and synthesized by the Cornell 



SUBSTITUTE SHEET (RULE 26) 



wo 01/38489 



PCTAJSOO/32227 



-52- 

Oligonucleotide Synthesis Facility. Sequencing of each plasmid clone was 
initiated with vector-specific primers or primers designed to previously 
determined sequences. Sequences obtained were analyzed using the same system 
and nucleotide or protein database searches were performed with the BLAST 
5 program (Altschul et aL, "Basic Local Alignment Search Tool " J. Mol. Biol.. 
215:403-419 (1990), which is hereby incorporated by reference). 
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Table 1. Transformation vectors and clones used 



Plasmid 



Length 



Characteristics 



Reference 



See Figure 29. 

Cloning vector, same as pUCATPH but Figure 8 

lacking a 420 bp Narl fragment containing 
the Hindlll site 

A clone containing pUCATPH recovered from Figure 2 
the tagged site in mutant R.C4.2696 by 
religation of fi^/II-digested genomic DNA 
As above but with Afrcl-digested genomic Figure 2 
DNA 

As above but with Sflcl-digested genomic Figure 2 
DNA 

Narl fragment derived from 2 1 4S 1 Figure 4 

containing a 0.8 kb Norl-Sacl fragment of 
genomic DNA ligated to pUC18 
Vector for targeted integration constructed Figure 4 
by ligating ///wdlll-digested pUCATPH into 
the///rtdIIIsiteofp214SlN 

Vector for targeted integration constructed Figure 6 
by ligation of a 2.2 kb Sad fragment of 
pllSBUintotheS'ad site of pUCATPH 
Vector for targeted integration constructed Figure 9 
by ligation of a 0.8 kb Sspl fragment of 
pi 1 8BC4 into the Sspl site of pUCATPHN 
A clone recovered from the p2 1 4SNP Figure 5 

integration site in transformant #1 18 by 
ligation of a 5g/II-digested genomic DNA 
fragment containing the entire vector 
A clone recovered from same site as above Figure 5 
but by ligation of a fic/I -digested genomic 
DNA fragment containing part of vector 
(214SNP) sequence 

A clone recovered from the p 11 8BSP Figure 7 

integration site in transformant #9 by ligation 
of aP^rt-digested genomic DNA fragment 
containing pUC18 

A clone recovered from the pi 1 8BCS Figure 9 
integration site in transformant #12 by 
ligation of a /^/wdlll-digested genomic DNA 
fragment containing the entire vector. 

a. An underlined kb number indicates that the plasmid carries genomic DNA 
5 sequences (see related figures for details). 



pUCATPH 5.1 

pUCATPH 4.6 
N 

p214B7 92 



p214Ml 62 

p214Sl 93 

p214SlN 33 

p214SNP M 

pllSBSP 73 

pllSBCS 14 

pll8B14 10.4 

pll8BC4 67 



p9P2 



7.3 



pl2H6 8£ 
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Table 2, Primers used for sequencing recovered genomic DNA flanking the REMI 
insertion site at the R.C4 2696 mutation. 



Name' 


Position** 


Sequence' 


Plasmid** 


Origin* 


M13RMT 




GCGGATAACAATTTCACACAGGA 
SEQ. ID. No. 4 


A 


pUC18 


1. RPlb 


775 


AGGCCCAGCTGCTTCTCTTG 
SEO ID No 5 


A 


214B7TrpC 


2, RP2 


604 


ACTCGGACCGGACGGAATAACAA 
SEQ. ID. No. 6 


A 


214B7RPlb 


3. RP3 


119 


CGGAAGGAGTGCGAACAA 
SEO ID No 7 


A 


214B7RP2 


4. RP4 


-232 


GCTGCTTGCATCTGGTCTTG 
^FO ID No S 


A 


214B7RP3 


5. RP5 


- -812 


AGACCCAGCTGTTGCCCATTG 
cpO [H Nn Q 


A 


214B7RP4 


6. RP5b 


-1215 


CGGAGACGCAAAGCCTGAGA 


A 


214B7RP4 


7. RP6 


-1392 


TGCCAGCTGCGTCCAAGAAG 


A 


214B7RP5 


8. RP7 


-1839 


GCTAGCATGGCCCTCACAC 
<;Fn in No 12 


A 


2I4B7RP6 


TrpC 




TGTGTTGACCTCCACTAGCTC 


A 


PUCATPH 




<;Fn ID No n 






9. FPl 


1885 


CTACGGGATGCAGAGGGAAAGT 
^FO ID No 14 


A 


2l4B7TrpC 


10. FPlb 


1828 


GCCATGATTAGCACGATACCC 
SEO ID No 15 


B 


214B7TrpC 


11. FP2 


2028 


CGCGCTGCATACAACTACCAA 
SEQ. ID. No. 16 


B 


214MlFPlb 


12. FP3 


2490 


TGGTGGCACTACAAAGAACA 
SEQ. ID. No. 17 


C 


214M1FP2 


13. FP4 


2949 


CAGCGTGTTGAATGGGTAGAA 
SEQ. ID. No. 18 


C 


214S1FP3 


14. FP4B 


2745 


CTGGGTAGATTCGCCTTCAC 
SEQ, ID. No. 19 


c 


214S1FP4 


15. FP5 


3421 


GAGCGATCAGTGCAGAACATT 
SEQ. ID. No. 20 


c 


214S1FP4 


16. FP6 


3948 


CGCTGACGTTTGACCATCTGA 
SEQ. ID. No. 21 


c 


214S1FP5 


17. FP7 


4411 


GCATATGCAACGAGTCAAA 
SEQ. ID. No. 22 


C,D 


214SIFP6 


18. FP8 


5035 


ACGGTGCACCTGTTGATA 
SEQ. ID, No. 23 


D 


118B14FP7 


19. FP9 


5457 


ATGCGCACAATAGCCCAGTA 
SEQ. ID. No. 24 




118BC4FP8 


20. RP48 


2865 


TTCAAGCAACTGTGGCGTAGG 
SEQ. ID. No. 25 


D 


214S1FP6 


21.FP10 


5790 


GATCCTAGCGACCGCACACCAAC 
SEQ. ID. No. 26 


F 


9P2FP9 


22. FPU 


6327 


CCTGCTGCTGGTGCTTCT 
SEQ. ID. No. 27 


F 


9P2FP10 
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23. FPUb 


6211 


GAGTTGCAAATCGTGACAGC 


F 


9P2FPI0 






SEQ. ID. No. 28 






24. FP12 


6457 


TATCAGCTGTTGTTCAATGTTCTA 


F 


9P2FP!I 






SEQ. ID. No. 29 






25. FP13 


6854 


TGTTATCCCATTGCCATTG 


F 


9P2FP12 






SEQ. ID. No. 30 






26. FP14 


7400 


AAGGACGGAGATTGGTGGAG 


F 


9P2FP13 






SEQ. ID. No. 31 




9P2FP14 


27. FP15 


7771 


GGAGATGGCGGTGACGA 


F 






SEQ. ID. No. 32 




9P2FP15 


28. FP16 


8145 


GCATGGCTTGTGGAGGAC 


F 






SEQ. ID. No. 33 




9P2FP16 


29. FPI7 


8492 


AGATTGTGGCTAGTATGGAGGTAA 


F 






SEQ. ID. No. 34 




pUC18 


M13F40 




GTTTTCCCAGTCACGAC 


G 






SEQ. ID, No. 35 




9P5M13F4 


30. RPI 


8953 


TACTACTAGCATACCAGCATACCT 


G 






SEQ. ID. No. 36 






31. RP2 


8559 


TCAACCTCGGAATACCAAGTC 


G 


9P5RP1 






SEQ. ID. No. 37 







a. "RP" indicates reverse primer; "FP" indicates forward primer. Primers 
designed to genomic DNA sequences are numbered in order. For stock tube and 
5 the notebook, Primers 1-17 have a leading nxmiber "214"; 18-20 with "118"; 21-29 
with "9P2" and 30-3 1 with "9P5". M13RMT(a M13R mutant version; there is a 
mutation in the polylinker of pUC18) and M13F-40 are provided by Cornell DNA 
Sequenceing Facility. TrpC primer site is in the pUCATPH TrpC promoter region 
38 bp from SaR site with sequencing direction from SaR to Kpril, 
10 b. The position of the first base of each primer corresponds to the assembled 

sequence {CPSl + TESl, total 1 1.3 kb found in Lu's folder in the computer Yoder 
labl) 

c. Each primer sequence is given in the 5' to 3' direction 

d. Plasmids used as templates for each sequencing reaction. A = p214B7; B = 
15 p214Ml; C = p214Sl; D = pi 18B14; E = pll8BC4; F = p9P2, G = p9P5 (=9P2) 

e. Original sequences that were used for primer design can be found in the CPSl 
sequence notebook or in Shunwen Lu's folder (CPSl sequence) in the computer 
Yoder labl under the same names as listed. 

20 Recovery of tagged DNA from the REMI insertion site and 

targeted gene disruption. Genomic DNA of mutant R.C4.2696 was digested with 
BgRl, Mscl (no sites in pUCATPH) or Sad (which cuts the vector once) and 
purified by phenol extraction and ethanol precipitation, then dissolved in TE (pH 
8.0). Ligation was performed in 50 ul reaction mixture, containing 1 x T4 DNA 

25 ligase buffer with 10 mM ATP, 60 units 14 DNA ligase (New England Biolabs, 
Beverly, MA) and 3 ug of 5g/II-digested genomic DNA, at 14°C overnight. 10 ul 
of ligation mixture was used to transform 200 ul of competent DH5a cells, 
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prepared using the calcium chloride treatment (Sambrook, et al., Molecular 
Cloning: A Laboratory Manual, 2"^ Ed., Cold Spring Harbor, New York:Cold 
Spring Harbor Laboratory Press (1989), which is hereby incorporated by 
reference), to ampicillin resistance. Ampicillin resistant clones were analyzed by 

5 digestion of plasmid DNA with several diagnostic restriction enzymes and clones 
containing the REMI vector plus flanking genomic DNA were sequenced using 
the vector-specific primers (M13R or TrpQ, Three plasmids, p214B7, p214Ml 
and p214S 1 (Figure 2) were recovered and used for sequencing. For targeted gene 
disruption in wild type, p214B7 was amplified and plasmid DNA purified by 

1 0 equilibrium centrifugation in CsCl-ethidium bromide gradients (Sambrook, et al., 
Molecular Cloning: A Laboratorv Manual , 2"'* Ed., Cold Spring Harbor, New 
York:CoId Spring Harbor Laboratory Press (1989), which is hereby incorporated 
by reference). 30 ug of plasmid DNA (linearized with BgRl for double crossover 
integration as illustrated in Figure 3) were used to transform wild type and the 

1 5 transformants were purified by isolation of single conidia, assayed for 
pathogenicity and characterized by gel blot analysis. 

Sequence extension by targeted integration and plasmid rescue. 
Two overlapping cosmid clones were isolated by probing a genomic DNA library 
of C4 constructed on a cosmid vector, but both extended into the left region only 

20 of p2 1 4B7. To extend to the right, a chromosome walking strategy was employed 
(Figure 3). Three targeted gene disruption experiments (each followed by plasmid 
rescue) were done successively. In the first experiment, a vector was constructed 
as follows (see Figure 4 for details): p214Sl was digested with Narl and religated 
to create p214SlN, which was then digested with HindlW and ligated into the 

25 ///VzdIII site of pUC ATPH to create p2 1 4SNP for transformation of race O (C5). 
One transformant (Txl 18) resulting from homologous integration (confirmed by 
gel blot analysis) was used for plasmid rescue as described above. Two new 
plasmids pi 1 8B 14 and pi 18BC4 were recovered, both of which carry sequence at 
the 3' end but only 172 and 680 bp more than p214S 1 , respectively (Figure 5). To 

30 continue the walk, pll 8B 14 was digested with Sad and ligated into the Sac\ site 
of pUCATPH to create pi 18BSP (Figure 6). This vector was transformed into 
wild type and one plasmid, p9P2 was recovered (from transformant Tx9), which 
extends 4.4 kb into the region 3* of pi 18BC4 and contains the 3' end of CPSl 
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(Figure 7). A third experiment was done in an attempt to recover a 1 5 kb Xhol 
fragment at the 3* end of that tagged gene, pi 18BCS (Figure 8) was constructed 
by subcloning a 0.8 kb Sspl fragment into the same site of pUCATPHN. Plasmid 
rescue using Xhol digested-genomic DNA of a transformant (TX12) failed to 
5 recover the 1 5 kb Xhol fragment, but pi 2H6 was recovered using M/idlll-digested 
genomic DNA of the same transformant; the genomic DNA matched that already 
cloned on p9P2 (Figure 9). 

EXAMPLE 2 " Characterization of the REMI mutant. 

10 1 ) In all culture conditions used in the lab, mutant R.C4.2696 

grows just like wild type with no variations in growth rate, color and 
morphological features (Figure lOA). It produces normal conidia that germinate 
and form infection structures like wild type when induced on artificial surfaces 
(Figure lOB) and shows normal mating ability when crossed to wild type testers. 

1 5 No pleiotropic phenotypes associated with the mutation have been detected so far. 

2) The mutant differs from wild type in the ability to cause disease 
on com plants. When tested on T-cytoplasm corn, the mutant produces race T 
type symptoms but the disease develops more slowly than with wild type although 
it produces wild type levels of T-toxin as detected in a microbial assay (Figure 

20 11), suggesting that the reduced virulence is not related to a deficiency in the 

ability to produce T-toxin. This is clearer on N-cytoplasm corn where the mutant 
produces lesions significantly smaller than those produced by wild type (Figure 
12). When the mutant was crossed to a wild type race O tester, the small lesion 
phenotype and ability to produce T-toxin segregated independently, indicating that 

25 mutant phenotype is not associated with the reduced fitness trait tightly linked 
with the Toxl locus (Klittich et al., "Reduced Fitness Associated With Toxl of 
Cochliobolus heterostrophusr Phytopathology . 76:1294-1298 (1986), which is 
hereby incorporated by reference). The statistical evaluation of lesion size in the 
race O genetic background indicates that the mutation causes 60% reduction in the 

30 fungal virulence to com plants (Figure 1 3). 

3) The mutant phenotype is caused by a tagged, single site 
mutation. In crosses between the mutant and wild type testers, progeny 
segregated 1 : 1 for parental types only and all hygromycin B-resistant progeny 
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produced lesions similar to the mutant parent; all hygromycin B-sensitive progeny 
produced wild type lesions (Figure 14), indicating that a tagged mutation is 
responsible for the reduced pathogenicity of the mutant. 

5 F.XAMPLE 3 ~ Cloning and sequencing ofDNA flanking the REMI 

vector insertion site. 

A total of 1 1 .3 kb of genomic DNA surrounding the insertion site 
was cloned and completely sequenced (Figure 15). The sequence was derived 
from seven plasmid clones. The first three (p214B7, p214Ml and p214Sl) were 

1 0 recovered from the tagged site in mutant R.C4.2696 and cover about 60 % (6.6 
kb) of the entire region. The rest (pi 18B14. pi 1 8BC4, p9P2 and pl2H6) were 
recovered from transformants generated using the chromosome walking strategy. 
DNA to the left of the insertion site (3.4 kb) was cloned on p214B7; DNA on the 
right (7.9 kb) was cloned on different overlapping plasmids. p9P2 carries the 

1 5 largest amount (4.6 kb) including genomic DNA on pl2H6 (Figure 1 5). 

EXAMPLE 4 - Jdentiflcation ofCPSl and TESl at the sequenced region. 

Analysis of the combined sequences revealed two open reading 
frames (ORFs). 0RF1(5.4 kb) starts 576 bp upstream of the REMI vector 
20 insertion site and ends with an in-frame stop codon (TAG) 3029 bp from the end 
of the sequenced region in the right flank (Figure 15). No "TATA" box-like 
element is found in the expected position, but five putative "CAAT" boxes are 
located upstream of the start codon (ATG), three of them are in the range found in 
most filamentous fungal promoters (60-200 bp) (Gurr et al., 1987, which is hereby 

25 incorporated by reference). Sequence around ATG of ORF 1 (C ACCATGCT 
(SEQ. ID. No. 38)) is similar to the fungal consensus (CACCATCGC (SEQ. ID. 
No. 39)). Although there are several ATGs found upstream, they are less likely to 
be used as a start codon because the surrounding sequences lack similarity to the 
consensus. Three putative introns are identified by their conserved 5' and 3" 

30 border sequences and potential branch sites (Table 3). Splicing these introns 

eliminated stop codons which would otherwise interrupt the 5.4 kb open reading 
frame. Three introns have similar size (45-53 bp respectively) which is in the 
range of intron size determined from most fungal genes. A putative 
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polyadenylation signal (ATAA) is found 223 bp downstream of the translation 
termination site. The G+C content of ORFl is 51 .5%, which is similar to most 
Cochliobolus genes (Turgeon et al., "Cloning and Analysis of the Mating Type 
Genes from Cochliobolus heterostrophus,'" Mol. Gen. Gene. . 238(1 -2):270-284 

5 (1 993); Van Wert et al, "Structure of the Cochliobolus-heterostrophus 

Glyceraldehyde-3-Phosphate Dehydrogenase Gene " Curr. Genet. , 22(l):29-35 
(1992); Yang et al., *'A Polyketide Synthase is Required for Fungal Virulence and 
Production of the Polyketide T-Toxin," Plant Cell , 8(1 1):2139-2150 (1996); Rose 
et al., "A Decarboxylase Required for Polyketide Toxin Production and High 

1 0 Virulence by Cochliobolus heterostrophus,'" 8^ Int. Svmp . Mol. Plant-Microbe 
Int. , Knoxville, p. J-49 ( 1 996), which are hereby incorporated by reference). 
Interestingly, ORFl is flanked by two regions of G+C rich DNA. The first (1 .4 
kb, 60.7% G+C) is found between ORFl and 0RF2; the second (1.2 kb, 60.3% 
G+C) is found 1 .8 kb downstream of the stop codon of ORFl (Figure 16). 

1 5 Database searches using the translated protein sequence of ORFl revealed high 
similarity to SafB, one of the multifunctional enzymes catalyzing the biosynthesis 
of the cyclic peptide antibiotic saframycin Mxl produced by the bacterium 
Myxococcus xanthus (Pospiech et al., "Two Multifunctional Peptide Synthetases 
and an 0-methyltransferase are Involved in the Biosynthesis of the DNA-Binding 

20 Antibiotic and Antitumour Agent Saframycin Mxl from Myxococcus xanthus,'" 
Microbiology . 142(4):74 1-746 (1996), which is hereby incorporated by 
reference). The entire nucleotide sequence of ORFl (designated CPSl ) is given 
in Figure 17. 

Table 3. Characteristics of putative introns in CPSl and TESl 
25 



Gene 


Intron 


Size(bp) 


Location 


5* Border 


3 'Border 


Branch Site 


CPSl 


I 


45 


3060-3105 


GTAAGT 


TAG 


GTCTAAC 




H 


51 


4532-4582 


GTAAGT 


GAG 


TGCTAAC 




III 


53 


5187-5239 


GTACGT 


GAG 


TACTAAC 


TESl 


I 


49 


528-566 


GTAAGT 


TAG 


CCTTAAG 


Consensus 








GTA^/CGT 


T/cAG 


YNCTAAC* 



SUBSTITUTE SHEET (RULE 26) 



wo 01/38489 



PCTAJSOO/32227 



-60- 



♦ Y = Pyrimidine (T or C); N = purine or pyrimidine. 

0RF2 starts about 1 .6 kb upstream of the start codon of CPSl and 

5 is transcribed in the opposite direction (Figure 15). No "TATA" box-like element 
and CAAT box are found; instead, an AT-rich sequence " AAAACTAT" (SEQ. 
ID. No. 40) is located 1 1 bp upstream of the start codon ATG and a CT motif is 
found in the -30 region, which is characteristic of a number of fungal genes that 
lack a CAAT box in their promoter region (Gurr et al., "The Structure and 

10 Organization of Nuclear Genes of Filamentous Fungi," in Kinghorn, ed.. Gene 
Stn.r.ti.re in Eukarvotic Microbes . Vol. 22, published by the Society for General 
Microbiology, Oxford, England:IRL Press, pp. 93-140 (1987), which is hereby 
incorporated by reference). The sequence around ATG matches perfectly fungal 
gene consensus. A putative intron (50 bp) is found in the middle of 0RF2 with 

1 5 conserved 5' and 3' border sequences and a potential branch site (Table 3). A 
putative polyadenylation signal (AAATA) is found 189 bp downstream of the 
translation stop codon TGA. The G+C content of ORF2 is 55.5%, which is 
slightly higher than the normal range because the 5' end of 0RF2 is located in the 
region of G+C rich DNA upstream of ORFl (Figure 16). Database search 

20 revealed that 0RF2 encodes a protein with high similarity to Homo sapiens 
thioesterase II (hTE, Liu et al., "Binding of HIV-1 Nef to a Novel Thioesterase 
Enzyme Correlates with Nef-Mediated CD4 Down-Regulation," J. Biol. Chem. , 
272(21:13779-13785 (1997), which is hereby incorporated by reference) and E. 
coli thioesterase II encoded by the tesB gene (Naggert et al., "Cloning, 

25 Sequencing and Characterization of Escherichia-coli Thioesterase II," J- Biol. 
Chem. . 266(17)11044-1 1050 (1991), which is hereby incorporated by reference). 
The nucleotide sequence of 0RF2 (designated TESl) is given in Figure 18. 

EXAMPLE 5 ~ Modular structure of CPSL 

30 Predicted CPSl protein (1743 amino acids, Mr 193235) contains 

two structurally similar modules, both of which are similar to SafBl, the first 
module of saframycin synthetase B (overall 25% identity; 50% similarity) and 
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have apparent amino-acid-activating and thiolation domains but lack 
methyltransferase activity, thus appearing to be typical type I modules (Figure 
19). The number of amino acids in each module is different: the first module 
(CPSl A) consists of 574 amino acids (from the first residue of core 1 to the last 
5 residue of core 6), which is larger than most type I modules; the second module 
(CPS IB) has 530 amino acids, which is average. The distance between the two 
modules is 193 amino acids, much shorter than most peptide synthetases (500-600 
aa), but this distance is not highly conserved, i.e., an opposite variation is found in 
HC-toxin synthetase and cyclosporine synthetase, both of which have about 1,000 

1 0 aa between the first and second amino-acid-activating module (Figure 20F). 

Amino acid alignment of the two modules of CPSl to SafBl 
indicated that these modules are highly similar to each other in both overall amino 
acid composition and conserved motif sequences as defined by Stachelhaus and 
Marahiel (Stachelhaus et al., "Modular Structure of Peptide Synthetases Revealed 

1 5 by Dissection of the Multifunctional Enzyme GrsA/' J. Biol. Chem. . 

270(1 1):61 63-6169 (1995); Marahiel, "Protein Templates for the Biosynthesis of 
Peptide Antibiotics," Chem. Biol. . 4(8):561-567 (1997), which are hereby 
incorporated by reference). When aligned to other bacterial or fiingal peptide 
synthetases, CPSl only showed local similarity to cyclosporine synthetase (Weber 

20 et al., "The Peptide Synthetase Catalyzing Cyclosporine Production in 

Tolypocladium niveum is Encoded by a Giant 45.8-Kilobase Open Reading 
Frame,'' Current Genetics . 26(2): 120-125 (1994), which is hereby incorporated by 
reference) and tyrocidine synthetase A (Mootz et al, "The Tyrocidine 
Biosynthesis Operon of Acillus brevis: Complete Nucleotide Sequence and 

25 Biochemical Characterization of Functional Internal Adenylation Domains," J. 

Bacteriol. , 179(21):6843-6850 (1997), which is hereby incorporated by reference), 
but when the amino acids in motif regions were aligned, a overall conservation 
was observed. Both CPSIA and CPSIB have all five core sequences in the 
amino-acid-activating domain (Figure 20A-E). Cores 3 and 4 are well conserved 

30 except for the replacement of an aspartic acid residue of core 4 by a leucine in 
CPSIA. Cores 1 , 2 and 5 show weak conservation, but similar variations are also 
seen in SafBl. A thiolation domain is found in both modules, which contains a 
highly conserved motif (core 6, Figure 20F). The serine residue in this motif has 
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been shown to be the active site for 4'-phosphopantetheine attachment 
(Schlumbohm et al, "An Active Serine is Involved in Covalent Substrate Amino 
Acid Binding at Each Reaction Center of Gramicidin S Synthetase," J. Biol. 
Chem. ,266r34):23135-23141 (1991); Stein et al., "Detection of 4'- 

5 Phosphopantetheine at the Thioester Binding Site for L- Valine of GramicidinS 
Synthetase 2," FEBS Lett. . 340(1 -2):39-44 (1994), which are hereby incorporated 
by reference). The distances between the six core sequences in the two modules 
are also largely conserved. Two exceptions are found in the first module, which 
has 312 aa between cores 2 and 3, larger than normal (150-200); 61 between cores 

10 5 and 6, only half of that of most peptide synthetases. SafBl also shows distance 
variations at these two interval regions (Figure 20B and E). In addition to amino- 
acid-activating and thiolation domains, CPSl also has an integrated thioesterase 
domain (TE) in the carboxy-terminal end of CPSIB (Figure 19). A signature 
sequence GXSXG, which is highly conserved in animal fatty acid thioesterase 

1 5 type II enzymes and several peptide synthetases, is found in this domain (Figure 
21). 

EXAMPLE 6 - Sequence homology analysis of TESl protein. 

The predicted TESl protein consists of 367 amino acids {Mr 

20 41013). Amino acid alignment of TES 1 to hTE, TESB and Mycobacterium 

tuberculosis TESB homolog (Philipp et al., "An Integrated Map of the Genome of 
the Tubercle bacillus, Mycobacterium tuberculosis H37Rv, and Comparison with 
Mycobacterium leprae;' Proc. Natl. Acad. Sci. USA , 93(7):3 132-3 137 (1996), 
which is hereby incorporated by reference) showed that these proteins have an 

25 overall 40% identity and 60% similarity. A highly conserved VHS motif (putative 
active site) is found in the C-terminal region of TESl at a conserved position 
(Figure 22). All these thioesterases have no sequence similarity with the 
previously identified animal type I or type 11 thioesterases knovm to be involved 
in the chain termination of fatty acid synthesis (Naggert et al., "Cloning, 

30 Sequencing and Characterization of Escherichia-coli Thioesterase II," J. Biol. 

ChenL, 266(1 7): 1 1044-1 1050 (1991), which is hereby incorporated by reference). 
Interestingly, TESl has more homology to hTE than to two bacterial genes, 
suggesting that both proteins belong to a new family of eucaryotic thioesterases. 
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EXAMPLE 7 ~ Targeted disruption of CPS J. 

Disruption of either CPSl A or CPSIB restored the original mutant 
phenotype. Ten transformants from each of four individual disruption 

5 experiments using different constructs, including the plasmid recovered from the 
REMI insertion site in the mutant (p214B7) and three vectors for chromosome 
walking (p214SNP, pi 18BSP and pllSBCS) were purified and assayed onN- 
cytoplasm com. All transformants showed the same small lesion phenotype as 
that of the original REMI mutant. Gel blot analysis confirmed that all 

10 transformants showing the mutant phenotype resulted from homologous 

integration of the transforming vector that disrupted the wild type CPS] (Figures 
24-26). No transformants showing the wild type phenotype were obtained, 
presumably because of the large genomic DNA fragments (over 800 bp in all 
disruption experiments) on the transforming vector that resulted from high 

1 5 efficiency of homologous recombination and the low chance to recover 
transformants with ectopic integration. 

EXAMPLE 8 - Methods and Materials for Examples 9-1 0 

Strains, growth conditions and transformation. Strains of 

20 Cochliobolus species and relatives used for genomic DNA hybridization are listed 
in Table 4. The strain HvW, a victorin-producing isolate of C. victor iae was 
recovered from storage and grown on CMX medium (Turgeon et al., 
"Transformation of the Fungal Maize Pathogen Cochliobolus heterostrophus 
Using the Aspergillus nidulans amdS Gene," Mnl. Gen. Genet., 201 :450-453 

25 (1985), which is hereby incorporated by reference) for conidiation or on oat meal 
agar medium (Churchill et al., "Victorin-Deficient REMI Mutants oi Cochliobolus 
victoriae Demonstrate a Requirement for Victorin in Pathogenesis," Fungal 
Genet. News!.. 42A:41 (1995), which is hereby incorporated by reference) for 
victorin detection at 24''C under warm white lights (Sylvania Inc., Danvers, MA). 

30 Transformation was done using the C. heterostrophus procedure (Turgeon et al., 
"Cloning and Analysis of the Mating Type Genes from Cochliobolus 
heterostrophus," Mol. Gen. Gene.. 238(1 -2):270-284 (1993), which is hereby 
incorporated by reference). 
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Table 4. Detection of CPS I homologs in Cochliobolus spp and relatives 



Strain' 



Host" 



£coRI 
digest' 



Hybridization 
digest"* 



BgR\ 
digest* 



C. heterostrophus 
race T f C4') 
race O (C5) 

C. carbonum 
race 1 f26R13) 
race 2 (YugY) 
race 3 (BZ1703)* 

C victoriae fHvW) 

C. salivus (A20) 

C jpec//^r (D5-7) 

C. homomorphus 
(ATCC 13409) 

C. dactyloctenii 
(7938-9) 

5. /Mz-c/cfl (NK2) 

5. ro5/ra/a (32 197) 

Rjgcchgri 
(764-1) 
(1249-10) 



Com 
{Turf-13) 

Com' 
{hmlhml) 



Oats (F&) 
Grasses^ 
Grasses^ 
Unknown 



Unknown 



Sorghum and 
maize^ 

Weeds and 
bamboo^ 

Sugarcane' 



+ 
+ 

+ 
N 
N 

+ 

+ 

+ 

N 

N 



+ 
N 



5.2 3.2 
5.2 3.2 

6.6 
6.6 
6.6 

N 

3.0 

N 

5.8 

5.9 

N 

2.8 



5.4 2.5 
5.4 2.5 



4.2 
4.2 

5.0 
5.0 
5.0 

5.0 

N 

N 

N 

N 

N 

N 



N 
N 



a C = Cochliobolus. S. = Selosphaeria.B. = Bioplaris. The name of isolates (or 

5 lab strains) of each species are given in parentheses and those known to produce 
host-specific toxins are underlined. ♦ Provided by Tsukiboshi Takao (Japan) and 
the isolate could be either BZ1209 or BZ1703. . 
b Genotype susceptible to the host-specific toxin-producing isolate is given in 
parentheses. References for hosts of those species not mentioned in the previous 

10 chapters are as follows: 1 : Welz et al., "Phenotypic Variation and ParasiUc 
Fitness of Races of Cochliobolus-carbonum on Cora in North Carolina," 
Phytopathology. 83(6)593-601 (1993); Leonard et al., "Genetic Diversity m Field 
Populations of Cochliobolus-carbonum on Com in North Carolina USA. 
Phytopathology. 80(1 1): 1 1 54-1 1 59 (1990) (for races 2 and 3 only). 2: Domsch et 

15 al.. "Compendium of Soil Fungi. Vol. 1," New York, New YorkrAcademic Press, 
pp 216-222 (1980). 3: David et al., "Fungi on Plants and Plant Products, St. , 
Paul, Minnesota:APS Press, p. 635 (1989); Thakur et al.. "Characterization of a 
New Race of Exserohilum-turcicum Virulent on Com With Resistance Gene 
HTN " Plant Dis- 73(2):151-155 (1989). 4: Rao et al., "New Fungal Diseases on 

20 Some Weeds." Indian Hot. Rep. . 6(1):38 (1987); Bhat et al., "Unrecorded 
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Pathogen on Bamboo Causing Blight in India." Curr. SCI. (BANG ALORE), 
58(20):1148-1 149 (1989). 5: Yoder, "Toxins in Pathogenesis," Ann. Rev. 
Phvtopathol. . 18:103-129(1980). 

c. Genomic DNAs (from a previously prepared gel blot filter, Rose et al., "A 
5 Decarboxylase Required for Polyketide Toxin Production and High Virulence by 
Cochliobolus heterostrophus^ Int. Svmp. Mol. Plant-Microbe Int. , Knoxville, 
p. J-49 (1996) were probed with the 3.4 kb CPSl fragment cloned on p214B7 
(Figure 2). indicates a strong hybridization signal. All species hybridized to a 
large fragment (-23 kb). 
10 d. Genomic DNAs selected from a lab collection were probed with the CPSl 3.2 
kb fragment cloned on p214Sl (Figure 2). The size of fragments that hybridized 
to the probe is given in kb. The intensities of hybridization signals were similar to 
each other. N = not done. 

e. Genomic DNAs were probed with the same CPSl fragment as in c. Gel blot is 
1 5 shown in Figure 26. 

DNA manipulations and targeted disruption of the CPSl 
homolog in C. victoriae. Genomic DNAs for probing were prepared according to 
Yoder ^Cochliobolus heterostrophus. Cause of Southern Com Leaf Blight," in 

20 Sidhu, ed.. Genetics of Plant Pathogenic Fungi , Vol, 6, San Diego, 

Califomia:Academic Press, pp. 93-1 12 (1988), which is hereby incorporated by 
reference), or selected from a lab DNA collection (stored at 4°C). A gel blot filter 
bearing known genomic DNAs was also probed. Plasmid DNA preparation, 
restriction enzyme digestions, gel electrophoresis, gel blot analysis were done 

25 using standard protocols (Sambrook, et al.. Molecular Cloning: A Laboratory 
Manual . 2"** Ed., Cold Spring Harbor, New York:Cold Spring Harbor Laboratory 
Press (1989), which is hereby incorporated by reference). For probing, CPSl 
fragments of C. heterostrophus cloned on p214B7 (3.4 kb left flank) and p214Sl 
(3.2 kb right flank) (Figure 2) were prepared by restriction enzyme digestion of 

30 the plasmid DNAs followed by purification using the QIAquick Gel Extraction 
Kit (QIAGEN Inc., Chatsworth, CA). The plasmid pi 18B14, which canies the 
2.2 kb Bgl II fragment of CPSl interrupted by the hygB cassette (see Figure 5) 
was linearized with BgUl and introduced into HvW genome. Transformants were 
purified by isolation of single conidia and genomic DNAs were digested with 

35 BgUl and probed with the CPSl 3.2 kb fragment. 

Bioassays. Pathogenicity was determined by an oat plant assay. 
Fungal strains were grown on individual oat meal agar medium plates (60 X 1 5 
mm) containing hygromycin B (60 ug/ml) for 10 days at 24°C under lights. 
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Conidia were scraped from the plates and suspended in 6 ml of sterilized distilled 
water. One ml of conidial suspension of each strain was mixed with 60 seeds of 
susceptible or resistant oats. Inoculated seeds were planted in 4" X 6" pots and 
seedlings were allowed to grow for two weeks. Seed germination rate and 
5 symptom development were recorded at different stages (4, 6, 8 and 24 days after 
inoculation). Detection of victorin production using HPLC analysis was done by 
Alice Churchill in Dr. Vladimir Macko's lab at Boyce Thompson Institute for 
Plant Research. 

1 0 EXAMPLE 9 - Detection ofCPSl liomologs. 

Genomic DNAs of 12 isolates (or lab strains) of 9 fungal species 
hybridized to CPSl (Table 4). All 6 Cochliobolus species, including 4 known 
plant pathogens (C carbonum, C. vicioriae, C. sativus and C. spicifer) and 2 
species with unknown hosts (C. homomorphus and C. dactyloctenii) gave 

1 5 hybridization signals of the same intensity as that of C. he teres trophus CPSl 
fragments (Figure 26, only C. carbonum and C. victoriae are shown). Two 
phytopathogenic Setosphaeria species and Bioplaris sacchari, a sugarcane 
pathogen gave a similar hybridization intensity. 

CPS] homologs appear to be polymorphic among different species, 

20 i.e., all species gave one or two unique bands when BgUl or HindlU digested 
genomic DNAs were probed (except for C. victoriae, which showed the same 
hybridization pattern as C. carbonum) (Table 4 and Figure 26). Interestingly, 
fcoRI digested genomic DNAs of the same species did not show polymorphisms; 
all species hybridized to a large fragment (-23 kb. Table 4), indicating the absence 

25 of an £coW site in all CPSl homologs as in the C. heterostrophus gene. In C 

heterostrophus, a > 12 kb of genomic region which includes CPSl (5.4 kb), TESl 
(1.1 kb) and sequence downstream of the 3' end of CPSl has no £coRI sites. In 
contrast to species-dependent polymorphisms, CPSl homologs appear to be 
highly conserved among different isolates of the same species. Both C 

30 heterostrophus race T and race O hybridized to the same 4.2 kb BgUl fragment (or 
5.2 and 3.2 kb //mdlll fragments); all three C. carbonum races hybridized to the 
same 5.0 kb Bglll fragment (or 6.6 kb HindlU fragment) (Table 4, Figure 26) and 
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B. sacchari isolates 764-1 and 1249-10 hybridized to the same //mdlll fragments 
(5.4 and 2.5 kb) (Table 4). 

FXAMPLE 10 ~ Targeted disruption ofCPSl Iwmolog in C. victoriae. 

Twenty transformants were obtained from transformation of the 
victorin-producing isolate HvW with Bg/II-linearized plasmid pi 18B14 (Figure 
5). Six transformants were purified and assayed for both victorin production and 
pathogenicity to susceptible oat plants. All transformants produced wild type 
levels of victorin as determined by HPLC analysis (Figure 27), but four of them 
(Tx7, Tx2, Tx5 and Tx8) showed dramatically reduced virulence in the plant 
assay. The seed germination rate on the eighth day after inoculation is only 13- 
25% for wild type and two transformants (Tx9 and Tx4), but 45-63% for the other 
four transformants. On day 24 after inoculation, all plants emerged from the seeds 
inoculated with wild type, Tx9 or Tx4 were killed but most (29-63%) from the 
seeds inoculated with Tx2, Tx7, Tx5 or Tx8 still survived (Table 5, Figure 28A). 
Gel blot analysis confirmed that transformants showing the reduced virulence 
phenotype resulted from homologous integration of the transforming vector that 
disrupted the wlld.type CPSl homolog in C. victoriae genome; transformants 
showing the wild type phenotype resulted from ectopic integration events that left 
the native gene intact (Figure 28B). AH transformants remained nonpathogenic to 
resistant oats, indicating that disruption of the CPSl homolog does not affect host 
specificity of the fungus. 

Table 5. Disease development of oat plants inoculated with C. victoriae 
transformants (Tx). 





No. germinated" 


Germination 


No. survivors 


Strain' 


4 


6 


8 


Rate (%)' 


24 % 


Control-1 


28 


41 


45 


75 


75 100 


Control-2 


40 


50 


50 


83 


50 100 


Control-3 


1 


7 


12 


20 


0 0 


Tx2 


8 


26 


27 


45 


16 59 


Tx4 


5 


15 


15 


25 


0 0 
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Tx5 


2 


24 


28 


47 


8 


29 


Tx7 


14 


36 


38 


63 


24 


63 


Tx8 


7 


29 


29 


47 


13 


47 


Tx9 


0 


3 


8 


13 


0 


0 



a. Control- 1 = uninoculated susceptible oat seeds. Control-2 and Control-3 = 
resistant and susceptible oat seeds inoculated with wild type C victoriae (isolate 
HvW), respectively. Six transformants were tested on both resistant and 

5 susceptible seeds, but only data for the later are shown (all transformants gave the 
same results as Control-2 when tested on resistant seeds). Repeat experiments 
gave similar results (data not shown). 

b. Sixty oat seeds were used for each strain. Emerged oat plants were counted 4, 
6 and 8 days after inoculation. 

1 0 c. Calculation based on the data collected on the day 8. 

d. Recorded on day 24 after inoculation. The percentage of survivors is based on 
the number of plants recorded on days 8 and 24. 

EXAMPLE 11 Isolation ofCPSl genes from other plant pathogens 
1 5 As dislosed in the previous examples, the Cochliobolus 

heterostrophus gene CPSl encodes a peptide synthetase that appears to be a 
general factor for ftingal virulence to their hosts. Thus, CPSl has been found to 
be highly conserved among at least 9 fungal species belonging to 3 genera 
including the genus Cochliobolus and closely related genera Bioplaris and 
20 Setosphaeria; it has been demonstrated to be required for pathogenesis of three 
different plant pathogens i.e., C heterostrophus race O , race T to com and C 
victoriae to oats (Lu, 1998, "Molecular-genetic analysis of general and specific 
pathogenesis factors in Cochliobolus heterostrophus,'' Ph.D thesis, Cornell 
University). 

25 To fiuther explore the role of CPSJ in fungal pathogenesis and its 

conservation in other fimgi, genomic DNAs of additional species in Cochliobolus 
and other closely or distantly related genera were- probed with ChCPSl by DNA- 
DNA hybridization (Lu, S.-W., B. G. Turgeon and O. C. Yoder. 1999. "A gene 
cluster from the com pathogen Cochliobolus heterostrophus required for 

30 nonribosomai peptide biosynthesis and general virulence of fiingi." Fungal 

Genetics Conference, March 1999, Pacific Grove, California). Genomic DNAs 
of 40 filed isolates (or lab strains) representing 34 fungal species belonging to 16 
genera hybridized when probed with ChCPSl (Figs. 30A-30C). 
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Fungal genomic DNAs were prepared according to a previously 
described procedure (Yoder, 1988. ''Cochliobolus heterostrophus, cause of 
Southern Com Leaf Blight". Genetics of Plant Pathogenic Fungi. G. S. Sidhu. San 
Diego, Academic Press. 6: 93-1 12). Plasmid DNA preparations, restriction 

5 enzyme digestions and preparation of DNA gel blots were performed following 
standard protocols (Sambrook, J., E. F. Fritsch, et al. 1989 "Molecular Cloning: A 
Laboratory Manual, 2nd Edition". Cold Spring Harbor, NY, Cold Spring Harbor 
Laboratory Press). A 3.2 kb ChCPSl fragment (corresponding to ChCPS 1 amino 
acids 173-1208) was obtained by restriction enzyme digestion of a plasmid clone 

10 p2 1 4S 1 (Lu, 1 998, Ph.D thesis), (see Figure 2), followed by purification using the 
QIAquick Gel Extraction Kit (QIAGEN Inc., Chatsworth, CA). The purified 
C/2CP57fragment was labeled with a-[^^P]dCTP (Turgeon, B. G., H. Bohlmann, 
et al, 1993. "Cloning and analysis of the mating type genes from Cochliobolus 
heterostrophusr Mol. Gen. Genet. 238: 270-284.). DNA-DNA hybridization 

1 5 was carried out at 6T C in 6 X SSC, 0.05 X BLOTFO (Sambrook et al., 1 989). 
Filters were washed in 2 X SSC, 0.1% SDS at 62° C for 60 minutes. 

All 16 Cochliobolus species, including the known plant pathogens 
C. carbonum, C victoriae, C miyabeanus, C. sativus and C. spicifer, and five 
genera closely related to Cochliobolus, i.e., Pyrenophora, Setosphaena, Bipolaris, 

20 Stemphyllium and Alternaria showed hybridization intensities comparable to that 
of C heterostrophus itself (Fig. 30 A). 

DNAs of species from nine distantly related genera, including 
several of economic importance (e. g., Magnaporthe grisea, Fusarium 
graminearum, Gaeumannomyces graminis) or of medical importance (e. g., 

25 Candida albicans) hybridized weakly to CPSl (FigS. 30B, 30C) whereas no 
signal was detected in DNA of the basidiomycete Ustilago maydis. 

Three CPSl homolog genes were cloned and characterized. 
Three of them were cloned from phytopathogenic fungi, including the wheat 
head scab fungus Fusarium graminearum {FgCPSly 6003 bp, SEQ. No. 43), 

30 the potato early blight fungus Alternaria solani, (AsCPSI, 2369bp, SEQ. No. 
41) and the barley net blotch fiingus Pyrenophora teres {PtCPSl 2306 bp, 
SEQ. No. 45). FgCPSl was cloned as a full length gene using both PCR 
amplification and the plasmid rescue procedure that was preceded by targeted 
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gene disruption in the genome. AcCPSl and PiCPSl homologs were partially 
cloned by direct PGR amplification. 

The Polymerase Chain Reaction (PGR) was carried out using 
degenerate primers designed to conserved regions of C. heterostrophus CPS J 

5 (ChCPSl). Two sets of degenerate primers were designed to amino acids at or 
close to conserved core sequences of C. heterostrophus CPSl (ChCPSl). The 
first pair of primers: 5TGYTTYATHGCNGGN GTNGTNGGNGTNCC3' 
(CHFP6, corresponding to positions 493-521 of ChCPSl) and 
5'YTGYTGNGGNGGNCCNCCNGGRTT3' (GHRP4, 2197-2220 of 

10 ChCPSl), was used to amplify CPSl from Fusarium graminearum. The 

second pair of primers: 5'-AARAARAARGGNCCNACNGAG-3' (FP4CB, 
conesponding to positions 1531-1550 of ChCPSl) and 5'SRYTGNA 
CGCADATYTCNCC3' (RP2DB, corresponding to positions 3883-3902 of 
ChCPSl), was used to amplify CPS 1 from A, solani and Pyrenophora teres. 

1 5 PGR was carried out in a Perkin Elmer Cetus 9600-thermocycler with fungal 
{Fusarium graminearium, Alternaria solani, and Pyreophora teres) genomic 
DNA as a template. 

Reaction mixtures contained about 500 ng of genomic DNA in 100 ul 
of reaction buffer [ 1 x £x Taq buffer, 0.2 mM dNTPs, 0.2 uM of each primer 

20 and 0.05 U/ml Takara Ex Taq (Pan Vera Corporation)]. An initial denaturing 
step of 95 *=*G for 3 min. was followed by 30 cycles of 94 ""C for 1 min, 47 ^G 
(for G. zeae) or 55 °G (for A, solni and Pyrenophora teres) for 3 min, and 72°G 
for 3 min. Reactions were cooled to 4 **C after a final extension of 72 °G for 10 
min. PGR products were examined (10 ul of each reaction) by agarose-gel 

25 (0.75%) electrophoresis. 

DNA from isolated clones of the three different fungi was 
sequenced at the Cornell DNA Sequencing Facility using TaqCycle automated 
sequencing with DyeDeoxy terminators (Applied Biosystems, Foster City, 
GA). Primers used for sequencing were designed using Primer Select 

30 PNASTAR) and synthesized by the Cornell Oligonucleotide Synthesis 

Facility. Sequencing of each plasmid clone was initiated with vector-specific 
primers or primers designed to previously determined sequences. Sequences 
were analyzed using MapDrawand MegAlign (DNASTAR) and nucleotide or 
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protein database searches were performed with the BLAST program (Altschul 

etal., 1990, 1997). 

The FgCPSJ open reading frame (5125 bp, SEQ. No. 43) has 
50% nucleotide identity to ChCPSl (SEQ IDNo.2) in about 4.4 kb of overlap 
5 (Fig. 2). No "TATA" box-like element was found in the 5' untranslated region, 
but other promoter sequences including two putative "CAAT" boxes and a 
"CT" motif were located upstream of the start codon (ATG) in FgCPSl (Fig. 
32). Only one putative intron was found 1508 bp upstream the stop codon 
(TGA) in contrast to three in ChCPSl (Figs. 31 A, 31B and 32). A putative 
10 polyadenylation signal "AATAA" was located 62 bp downstream of the stop 
codon (Fig. 32). The predicted FgCPSl protein (1692 amino acids, Mj 187983 
Da, SEQ ID No. 44) has 68% identity, 73% similarity to ChCPSl (SEQ ID No. 
3) in a about 1 ,500 amino acid overlap (Figs. 3 1 A and 3 IB) that contains two 
structurally similar modules highly similar to those of ChCPSl . FgCPSl has 
1 5 no significant similarity to ChCPS 1 at the C-terminus, which is relatively 
shorter and lacks the thioeterase domain as seen in ChCPSl (Figs. 31 A and 
3 IB). The annotated FgCPSJ sequence is given in Figure 32. 

AsCPSI (2369 bp, SEQ. No. 41) has 76% nucleotide identity to 
ChCPSl (SEQ ID No.2) in the entire cloned region which contains two conserved 
20 introns (Figs. 3 1 A and B). The translated AsCPS 1 protein (partial) includes 75 8 
amino acids corresponding to amino acids 51 1-1269 in ChCPSl and has up to 
93% identity, 95% similarity to ChCPS 1 (Fig. 2). The annotated AsCPSI 
sequence is given in Figure 33. 

PtCPSl (2306 bp, SEQ. No. 45) has 78% nucleotide identity to 
25 ChCPSl (SEQ IDNo.2) in the entire cloned region which contains only one 

intron (Fig. 2). The translated PtCPSl protein (partial) includes 758 amino acids 
corresponding to amino acids 51 1-1269 in ChCPSl and has 93% identity, 96% 
similarity to ChCPSl (Fig. 2). The annotated PtCPSl seqence is given in Figure 
34. 

30 

FX AMPLE 12 - Targeted disruption ofCPSl homolog in F. graminearum. 

A 2.2 kb Xbal fragment from pUCATPH (Lu, et al, 1 994) 
containing the bacterial hygmycin resistance gene (hygB) driven by the 
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Aspergillus nidulans trpC promoter was inserted into the site of a PCR clone 
pFgC8, which carries the l.O kb internal fragment of FgCPSl, to create pFgC8- 
hygB. This construct was transformed (in a circular form or linearized with 
Hindlll or Bgai) into an isolate (GZ3639) of wild type F. graminearum. Twenty 
5 transformants were obtained by transformation of a wild type isolate (GZ3696) 
with circular plasmid pFgC8-hygB. DNA gel blot analysis of eight such 
transformants confirmed that the CPSl homolog in F. graminearum was disrupted 
by a single cross over recombination (Fig. 35A). Two transformants were 
obtained when the Bgfll-digested pFgC8-hygB was used and only one recovered 
1 0 when the plasmid was digested with Hindlll. Gel blot analysis indicated that all 
three transformants obtained using linearized plasmid integrated into the genome 
at a ectopic location that left the wild type FgCPSI gene intact (Fig. 35A). For 
virulence assays, F. graminearum strains were grown on PDA (or PDA plus 
hygromycin B for transformants) plates (100 X 15 mm) for 7-1 5 days at 24 C 
1 5 under black lights (Sylvania Inc., Danvers, MA) for maximum conidiation. A 
susceptible spring wheat cultivar, Norm Hard Red (kindly provided by G. 
Bergstrom, Cornell university) was used. Two months old wheat plants grown in 
the greenhouse at anthesis (10 plants in one 4" X 6" pot) were sprayed with 10 ml 
conidial suspensions (10^ or 10^ conidia/ml) using a pressurized Preval Spray 
20 Gun Power Unit thin layer chromatography sprayer (Alltech Associates, 
Deerfield. IL) or "injected" into the spikelets (5-10 ul/ per spikelet) using a 
Benchmate pipettor (NICHIRYO, Japan). The inoculated plants were incubated in 
a mist chamber for 48 hours (23° C) and then transferred to a growth chamber (23° 
C, 14 hours of light). Mutant phenotypes were identified by the comparison of the 
25 number of infected or "bleached" spikelets on each head of wheat plants. 

Symptoms were recorded 7-10 days after inoculation. All cpsl' disruptants had 
reduced virulence on wheat plants in the assays while ectopic transformants 
caused disease symptoms indistinguishable from those of wild type (Fig. 35B). 
Although the invention has been described in detail for the 
30 purposes of illustration, it is understood that such detail is solely for that purpose, 
and variations can be made therein by tiiose skilled in the art without departing 
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from the spirit and scope of the invention which is defined by the following 
claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule from a plant pathogen encoding 
a CPSl synthetase, wherein the nucleic acid molecule comprises a nucleotide 
5 sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 41 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 41. 

1 0 2. The isolated nucleic acid molecule according to claim 1 wherein 

the plant pathogen is selected from the group consisting of Alternaria solani, 
Alternaha Alternaia alternatherae, A. alternala, A. amaranthi, A. araliae, A. 
brassicae. A. brassicicola. A. camelliae. A. cassiae, A. cheiranthi, A. cinerariae, 
A. gossypii. A. helianthi. A. helianthinficiens. A. mali, and A. raphani. 

15 

3. The isolated nucleic acid molecule according to claim 2 wherein 
the plant pathogen is Alternaria solani. 

4 . An isolated nucleic acid molecule coding for a polypeptide having 
20 the amino acid sequence as,set forth in SEQ ID No. 42. 

5. An isolated nucleic acid molecule comprising the nucleotide 
sequence as set forth in SEQ ID No. 41. 

25 6. An isolated polypeptide encoded by the nucleic acid molecule of 

claim I. 

7. The polypeptide according to claim 6, wherein the polypeptide has 
an amino acid sequence which has at least 75 % similarity to the amino acid 
30 sequence of SEQ. ID. No. 42 as determined by a BLAST program with the default 
parameters. 
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WHAT IS CLAIMED IS: 

1 . An isolated nucleic acid molecule from a plant pathogen encoding 
a CPSl synthetase, wherein the nucleic acid molecule comprises a nucleotide 
5 sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 41 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 41. 

1 0 2. The isolated nucleic acid molecule according to claim 1 wherein 

the plant pathogen is selected from the group consisting of Alternaria soldnU 
Alternaria Alternaia alternatherae, A. altemata, A. amaranthi, A, araliae, A. 
brassicae. A, brassicicola, A. camelliae, A. cassiae, A. cheiranthi, A. cineraride, 
A. gossypii, A. helianthi, A, helianthinficiens, A. mali, and A, raphanl 

15 

3. The isolated nucleic acid molecule according to claim 2 wherein 
the plant pathogen is Alternaria solanL 

4. An isolated nucleic acid molecule coding for a polypeptide having 
20 the amino acid sequence as^set forth in SEQ ID No. 42. 

5. An isolated nucleic acid molecule comprising the nucleotide 
sequence as set forth in SEQ ID No. 41. 

25 6. An isolated polypeptide encoded by the nucleic acid molecule of 

claim 1. 

7. The polypeptide according to claim 6, wherein the polypeptide has 
an amino acid sequence which has at least 75 % similarity to the amino acid 
30 sequence of SEQ. ID, No. 42 as determined by a BLAST program with the default 
parameters. 
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8. The polypeptide according to claim 7 wherein the polypeptide 
comprises an amino acid sequence as set forth in SEQ ID No. 42. 

9. An isolated nucleic acid molecule from a plant pathogen encoding 
a CPSl synthetase, wherein the nucleic acid molecule comprises a nucleotide 
sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 43 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 43. 



1 0. The isolated nucleic acid molecule according to claim 1 wherein 
the plant pathogen is selected from the group consisting of Fusariumi 
graminearium, Fusarium avenaceum, F. carpineum, F. chlamydosporum, F.. 
coccophilum, F. culmorum, F. episphaeria, F. equiseti, F. flocciferum, F. 

15 moniliforme, F. oxysporum, F. redolens, F, sambucinum, F. solanl F, 
subglutimns, F, (richothecioides, F. udum, or F. ventricosum, 

11. The isolated nucleic acid molecule according to claim 10 wherein 
the plant pathogen is Fusarium graminearium, 

20 

12. An isolated nucleic acid molecule coding for a polypeptide having 
the amino acid sequence as set forth in SEQ ID No. 44. 

1 3. An isolated nucleic acid molecule comprising the nucleotide 
25 sequence as set forth in SEQ ID No. 43. 

14. An isolated polypeptide encoded by the nucleic acid of claim 9. 

15. The polypeptide according to claim 14, wherein the polypeptide 
30 has an amino acid sequence which has at least 75 % similarity to the amino acid 

sequence of SEQ. ID. No. 44 as determined by a BLAST program with the default 
parameters. 
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1 6. The polypeptide of claim 1 5 wherein the polypeptide comprises the 
amino acid sequence as set forth in SEQ ID No. 44. 

1 7. An isolated nucleic acid molecule from a plant pathogen encoding 
5 a CPSl synthetase, wherein the nucleic acid molecule comprises a nucleotide 

sequence which hybridizes under stringent conditions to a nucleic acid molecule 
having a sequence as set forth in SEQ ID No. 45 or a complement thereof; or 
wherein the nucleic acid molecule comprises a nucleotide sequence as set forth in 
SEQ ID No. 45. 

10 

1 8. The isolated nucleic acid molecule according to claim 1 wherein 
the plant pathogen is selected from the group consisting of Pyre nophor a teres, 
Pyrenophora avenae, P. bromi, P. leuceienes, P. phaeocomes, P. schroeteri, P. 
trichostoma, or P. tritici-repentis. 

15 

1 9. The isolated nucleic acid molecule according to claim 1 8 wherein 
the plant pathogen is Pyrenophera teres, 

20. An isolated nucleic acid molecule coding for a polypeptide having 
20 the amino acid sequence as set forth in SEQ ID No. 46. 

21. An isolated nucleic acid molecule comprising the nucleotide 
sequence as set forth in SEQ ID No. 45. 

25 22. An isolated polypeptide encoded by the nucleic acid of claim 1 7. 

23. The polypeptide according to claim 22, wherein the polypeptide 
has an amino acid sequence which has at least 75 % similarity to the amino acid 
sequence of SEQ. ID. No. 46 as determined by a BLAST program with the default 

30 parameters. 

24. The polypeptide of claim 23 wherein the polypeptide comprises the 
amino acid sequence as set forth in SEQ ID No. 46. 
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25. A vector comprising the nucleic acid molecule of any one of claims 
1-5, 9-13, or 17-21. 

26. A vector according to claim 25, wherein the nucleic acid molecule 
5 is operably linked to a promoter. 

27. A vector according to claim 26, wherein the nucleic acid molecule 
is in a sense orientation. 

10 28. A vector according to claim 26, wherein the nucleic acid molecule 

is in an antisense orientation. 

29. The vector according to claim 26, wherein the vector is capable, of 
maintaining and expressing the nucleic acid molecule in bacterial cells. 

15 

30. The vector according to claim 26, wherein the vector is capable of 
maintaining and expressing the nucleic acid molecule in plant cells. 

31. A host cell transformed with the vector according to claim 25. 

20 

32. A host cell according to claim 3 1 , wherein the host is a plant. 

33. A host cell according to claim 3 1 , wherein the host is selected from 
the group consisting of corn, oats, grasses, weeds, bamboo, and sugarcane. 

25 

34. A host cell according to claim 3 1 , wherein the host is com. 

35. A plant transformed with the nucleic acid molecule according of 
any one of claims 1-5, 9-13, or 17-21. 

30 

36. An isolated DNA molecule according to claim 1 , wherein the 
nucleic acid molecule has a nucleotide sequence which is 70% identical to the 
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nucleotide sequence of SEQ. ID. No. 41 as determined by a BLAST program 
with default parameters. 

37. An isolated DNA molecule according to claim 1 , wherein the 

5 nucleic acid molecule is 80% identical to the nucleotide sequence of SEQ. ID. 
No. 41 as determined by a BLAST program with default parameters. 

38. An isolated DNA molecule according to claim 1, wherein the 
nucleic acid molecule is 90% identical to the nucleotide sequence of SEQ. ID. 

10 No. 4 1 as determined by a BLAST program with default parameters. 

' 39. An isolated DNA molecule according to claim 1, wherein the 
nucleic acid molecule has a nucleotide sequence which is 70% identical to the 
nucleotide sequence of SEQ. ID. No. 43 as determined by a BLAST program 
1 5 with default parameters. 

40. An isolated DNA molecule according to claim 1 , wherein the 
nucleic acid molecule is 80% identical to the nucleotide sequence of SEQ. ID. 
No. 43 as determined by a BLAST program with default parameters. 

20 

41. An isolated DNA molecule according to claim 2 1 , wherein the 
nucleic acid molecule is 90% identical to the nucleotide sequence of SEQ. ID. 
No. 43 as determined by a BLAST program with default parameters. 

25 42. An isolated DNA molecule according to claim 1 , wherein the 

nucleic acid molecule has a nucleotide sequence which is 70% identical to the 
nucleotide sequence of SEQ. ID. No. 45 as determined by a BLAST program 
with default parameters. 

30 43 . An isolated DNA molecule according to claim 1 , wherein the 

nucleic acid molecule is 80% identical to the nucleotide sequence of SEQ. ID. 
No. 45 as determined by a BLAST program with default parameters. 
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44. An isolated DNA molecule according to claim 2 1 , wherein the 
nucleic acid molecule is 90% identical to the nucleotide sequence, of SEQ. ID. 
No. 45 as determined by a BLAST program with default parameters. 

5 45. A method for identifying inhibitors of a CPS 1 protein , wherein 

said CPSl protein is a peptide synthetase of a plant pathogen, said method 
comprising: 

providing a CPSl protein or polypeptide; 

contacting the protein or polypeptide with potential inhibitor 

1 0 compounds; 

determining peptide synthetase activity; and 

selecting compounds which decrease the peptide synthetase 

activity. 

1 5 46. The method of Claim 45 wherein the CPS 1 protein is from 

Alternaria solani, 

47. The method of Claim 45 wherein the CPSl protein is from 
Fusarium graminearium, 

20 

48. The method of Claim 45 wherein the CPSl protein is from 
Pyrenophora teres. 

49. A method of imparting disease resistance to a plant, said method 
25 comprising over-expressing a CPS 1 polypeptide in the plant, wherein the 

polypeptide has protein synthetase activity. 

50. A method according to claim 49, wherein the plant is selected from 
a group consisting of com, oats, grasses, weeds, sugarcane, barley, wheat, rice, 

30 tomato, potato, citrus, maius, rye, cotton, brassica, cabbage, and carrot. 

51. The method of claim 49 wherein the CPS 1 polypeptide is from 
Alternaria solani. 
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52. The method of Claim 49 wherein the CPSl polypeptide is from 
Fusarium graminearium, 

53. The method of Claim 45 wherein the CPSl polyeptide is from 
5 Pyrenophora teres. 
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TGCCTGCGCCTGTGCTTGTGCCTGTGGAATGTCGCGGCCCGCTGCTGCAT -776 

AGCCTATCT6TACATACAACACCATCCCATCCCGCTTCACCTGCCTTGCC -726 

TCCCTCCTCGTGCCACACATCCGCCGCCCACAACACCATGGCTGCGACCA -676 

ACCCCGAGCTGCAGGCCAAACTGCAGGAGCTGGACCACGAGCTCGAGGAG -626 

GGCGATATTACACAAAAAGGGTCCGTACTGCTGCACCACCACCGCCATCC -576 

GCCTCTCTGCGTGCGCTAATCAGTCGCATAGCTATGAAAAACGTCGCACC -526 

GTGCTGCTGTCGCAGTATCTAGGGCCTGACTTTGCTGCCCAGTTGCA6GC -476 

CGACCTGAACCAGCAGAACCCACCCCAACCATCCAGTGAGGGCTCTCGCT -426 

CCCGCACCGCATCCTTTGCTATTCCGTCCGGTCCGAGTCCATCACGGCGA -376 

CCACAACCCCCACATATCCAGCTCCCCCGCCCCGACTCATACCATGACGC -326 

TTCCGCACAGGGCCAATTGGGCGCACCCATGCCATATGCGAACGCCTCCG -276 

CCGCTGCCTCGGGGGGCTCGCAGTACATGGCATACCCGCCCAGCCAAGTC -226 

GGCCGTTTTCAAGAGAAGCAGCTGGGCCTGCGTACAAATTCGCTCCAGCG -176 

CAATTCCTCACAGCTGTCGCAAGGAAGCGAGACGTTCATTCCACGGCCTC -126 

AAACGCCTGAATACAACCACTCGCGCGAGCCCACCATGATGGGCAACTAC -76 

gccttVaatccagaVaatcagcaaagttatgatggccaatttggctctcc -26 

GGGAGAGGCCAGTCGAAGGAGCACCAIfiCTCGAGGTAAACCAGGGTTATT 25 

ML EVNQGY 

TTTCCGACTTCACAGGCCAGCAGATGCAAGACAATCGCGACTCGTATGGG 75 
FSDFTGQQMQDNROSYG 

GGACCCAACCGCTACTCGTCGGGAGATGCCTTTTCTCCTACCGCCGCGAT 125 
GPNRYSSGD.AFSPTAA I 

TCCACCTCCCATGATGAACCCCAACGATCTCCCCTTGGGCGCTGCTGAAA 175 
PPPMMNPNOLPLGAAE 

CCATGATGCCGCTAGAGCCCCGCGATCTGCCTTTTGACGTTTACGACCCT 225 
TMMPLEPROLPFDVYDP 
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CACAACCCCAATGTCAAAATGTCAAAGTTTGACAACATTGGCGCTGTCTT 
HNPNVKMSKFDNIGAVL 

GCGTCACCGAAGTCGCACACAGCCAAGGACGACTGCCTTCTGGGTCCTTG 
RHR SRTQPRTTAFWVL 

ACGCAAAAGGCAAAGAGACG6CGTCCATCACCTGGGAAAAGGTGGCTAGT 
DAKGKETASITWEKVAS 

CGCGCGGAAAAGGTGGCCAAAGTGATTCGGGACAAGAGCAACCTCTATCG 
RAEKVAKVIRDKSNLYR 

AGGCGACCGTGTGGCATTAGTGTACAGGGATACAGAAATCATTGATTTTG 
GDRVALYYRDT EI lOF 

TCGTGGCGTTGATGGGC TGCTTCATTGCGG GCGTTGTAGCGGT ACCCATC 

V V A L M G <j^^^^^^i^¥aaVW»^^i 

Core 1 

AATAGCGTCGACGACTACCAGAAACTCATTCTTCTCCTAACGACAACTCA 
DYQKLIL LLTTTQ 



AGCTCATCTCGCATTGACCACAGACAACAATCTCAAGGCCTTTCATCGTG 
AHLALTTDNNLKAFHR 

ACATTAGTCAGAACCGTCTGAAATGGCCGAGTGGGGTAGAGTGGTGGAAG 
OISQNRLKWPSGVEW WK 

ACGAACGAGTTTGGCAGCCACCACCCCAAGAAACATGACGATACTCCAGC 
TNEFGSHHPKKHDDTPA 

TTTGCAAGTACCAGAGGTTGCCTATATTGAGTTCTCGCGTGCACCTACTG 

LQYPEVAYIE i^rai^^a^^S 

Core 2 

GTGACCTTCGCG6TGTGGTGCTTAGTCACCGGACTATTATGCACCAAATG 

M H Q M 



GCCTGCATCAGTGCCATGATTAGCACGATACCCACCAACGCTCAGAGCCA 
ACISAMISTIPTNAQSQ 

AGACACGTTCAGCACTAGCCTACGGGATGCAGAGGGAAAGTTCGTTGCTC 
DTFSTSLROAEG KFVA 

CAGCACCGTCCAGAAACCCCACAGAAGTGATCCTCACGTACCTCGACCCG 
PAPSRNPTEVILTYLDP 

CGCGAAAGCGCTGGTCTCATTCTCAGTGTCTTGTTTGCAGTTTATGGAGG 
RESAGLILSVLF AVYGG 
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CCACACCACCGTATG6CTCGAGACAGCGACCATGGAAACCCCGGGTCTAT 1075 
HTTVWLETATMETPGL 

ATGCACATCTCATCACCAAATACAAGTCCAACATACTGCTAGCGGATTAC 1 125 
YAHLITKYKSNILLADY 

CCAGGCCTCAAGCGCGCTGCATACAACTACCAACAGGATCCAATGGCTAC 1 175 

PGLKRAAYNYQQDPMAT 

AAGAAACTTCAAGAAAAACACAGAACCCAACTTCGCCTCCGTGAAGATCT 1225 
RNFKKNTEPNFASVK 1 

GTCTGATTGACACGCTTACCGTCGACTGTGAATTTCACGAAATTCTCGGA 1275 
CLIDTLTVDCEFHEIL G 

GATCGATATTTCAGGCCACTGCGAAACCCTAGAGCGCGAGAACTGATCGC 1 325 
ORYFRPLR NPRAREL I A 

GCCAATGCTCTGCTTGCCAGAACATGGTGGAATGATAATATCTGTACGCG 1375 
PHLCL PEHGGMII .SVR 

ACTGGCTAGGTGGAGAGGAGCGCATGGGCTGCCCGCTAAGCATAGCAGTA 1 425 
DWLGGEERMGCPLSIAV 

GAAGAGTCAGATAATGATGAAGATGATACAGAGGATAAGTATGCAGCGGC 1 476 
EESDNDEDOTEO KYAAA 

AAATGGCTACTCCAGTCTTATTGGTGGTGGCACTACAAAGAACAAAAAGG 1 525 
NGYSSLIGGGTTKNKK 

AGAAGAAGAAGAAAGGCCCGACAGAGCTTACAGAAATCTTGCTGGACAAG 1 575 
EKKKKGPTELTEILLDK 

GAAGCTCTGAAGATGAACGAAGTCATTGTTCTGGCCATTGGAG AAGAAGC 1 625 
EALKMNEVIVLAIGEEA 

AAGCAAGCGGGCAAACGAGCCCGGCACCATGCGAGTCGGTGCCTTTGGAT 1675 
SKRANEPGTMRVGAFG 

ACCCCATACCGGATGCGACACTAGCTATTGTAGACCCTGAGACAAGTCTT 1 725 
YPIPDAT LAIVDPETSL 

CTATGTTCACCATACTCGATAGGCGAGATCTGGGTAG ATTCG C CTTCACT 1 775 

L c s P Y s i^S^MBTSKi^^H^ 

Cores 

CTCTGGTGGCTTCTGGCAGCTGCAGAAGCATACAGAGACCATTTTCCATG 1 825 
ISMMaffiH WQLQKHTETIFH 
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GCATTGAGCGATCAGTGCAGAACATTGCGCTCGGTGACGATCCCGCTGGC 
GIERSVQNIALGOOPAG 

GGCATGTGGTCATTtGAGGCATCAATGGCACGTCAGCAATTCTTGATGCT 
GM WSFEASMARQQFLML 



1875 
1925 



CTCGACCATACCGTTTCGTTGAGGGTAGCCCTACGCCACAGTTGCTTGAA 
ARPYRFVGGSPTPQLLE 

CTCGAGTTTCTGCGTACTGGACTCCTCGGCTTTGTTGTAGAGGGAAAAAT 

L E F I ^m^M^ ^W av V E G K I 
Core 4 

ATTTGTCCTTGGACTGTACGAAGATCGCATCAGACAGCGTGTTGAAT6GG 1975 
Core 5 

TAGAAAATGGTCAGCTTGAAGCCGAGCATCGATACTTTTTTGTGCAGCAC 2025 

AEHRYFFVQH 



CTGGTCACAAGCATTATGAAGGCCGTGCCAAAAATTTACGACTGgtaagt 2075 
LVTSIMKAVPKIYOC 

gagctgccaacogagcaaggactgtctaacgtgtcatagCTCGTCGTTTG 2125 

S S F 

ATTCTTATGTAAATGGTGAATACCTGCCAATCATTCTCATCGAGACGCAG 2 1 75 
DSYVNGEYLPIILIETQ 

GCCGCATCGACTGCGCCCACAAACCCAGGTGGACCACCACAACAATTGGA 2225 
AAStAPTNPGGPPQQLD 

TATACCATTTTT GGATTC ACTATCTGAGAGGTGCATGGAGGTCCTTTACC 2275 
I P F L^— »S ERCMEVLY 
Core 6 

AAGAGCATCATTTAC6GGTATACTGCGTGATGATTACAGCACCTAATACA 2325 
QEHHLRVYCVMITAPNT 

CTTCCACGAGTCATCAAGAACGGACGGCGAGAAATTGGCAATATGCTGTG 2375 
LPRVIKNGRREIGNMLC 

TAGGAGAGAGTTTGACAATGGCTCTCTGCCCTGTGTNCACGTNAAGTTTG 2425 
RREFONGSLPCVHVKF 

2475 
2525 



CCAA6ACAAGCAATACTCTGGTGTCGATCATCGCGAAGTCGTCATTGACG 2575 
QDKQYSGVD HREVV I D 

. ACAGGACATCGACTCCACTCAATCAGTTCTCGAATATCCACGACCTGATG 2625 
DRTSTPLNQFSNIHDLM 
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CAATGGCGTGTATCTC6GCAGGCCGAGGAACTTGCTTACTGCACTGTCGA 2675 
QWRVSRGAEELAYCTVO 

CGGTCGAGGAAAAGAGGGCAAAGGCGTCAATTGGAAGAAGTTTGATCAAA 2725 
GRGKEGKGVNWKKFDQ 

AGGTTGCGGGCGTA6CAATGTACCTCAAGAACAAGGTCAAGGTCCAGGCC 2775 
KVAGVAMYLKNKVKVQA 

GGCGATCATCTCCTTCTGATGTACACGCATTCAGAAGAATTTGTTTATGC 2826 
GOHLLLMYT HSEEFVYA 

TGTTCATGCATGTTTTGTGCTTGGAGCTGTTTGCATACCAATGGCGCCAA 2875 

Y H A Br<M^aMimMMK«M^>^f«MBM 
Corel 

TTGATC AGAACCGGTTGAATGAGGATGCGCCGGCCTTGCTGCATATCCTT 2925 
QNRLNEOAPALLH I L 

GCAGATTTCAAGGTCAAA6CCATTCTTGTCAACGCTGACGTTGACCATCT 2975 
AOFKV KAILVNAOVDHL 

GATGAAGATCAAGCAAGTATCGCAGCACATCAAACAATCGGCCGCTATCC 3025 
MKIKQVSQHIKQSAAI 

TCAAGATCAGTGTGCCAAACACATACAGCACAACAAAGCCGCCAAAGCAA 3075 
LKISVPNTYSTTKPPKQ 

TCCAGTGGCTGCCGCGACCTCAAGCTTACAATTCGACCGGCATGGATTCA 3 1 25 
SSGCRD LKL T IRPAWIQ 



GGCGGGTTTCCCAGTGCTAGTCTGGA^^A^G^Cl^C^^I^^ 3175 

Core 2 

GTATCGCAGTTCAGCTGG6CCATAGCCAAATCATGGCACTGTGCAAGGTC 3225 

Bm a L C K V 



CAAAAAGAAACATGCCAAATGACAAGTACACGACCAGTCCTTGGTTGTGT 3275 
QKETCQMTSTRPVLGCV 

CCGGAGCACGATAGGACTTGGTTTCCTTCACACTTGTCTCATGGGAATCT 3325 
RSTIGLGFLHTCLMGI 

TCCTTGCCGCACCCACATACCTGGTGTCACCTGTTGACTTTGCACAAAAC 3375 
FLAAPTYLVSPVDFAQN 

CCTAATATTCTGTTCCAAACGCTTTCGCGGTACAAGATCAAGGATGCATA 3^125 
PNILFQTLSRYKIKDAY 
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TGCAACGAGTCAAATGTTGGACCACGCCATCGCACGCGGAGCTGGTAAGA 3175 
ATSQMLDHAIARGAGK 

GTATGGCTCTGCACGA6CTGAAGAATCTCATGATTGCGACTGATGGAAGA 3525 
SMALHELKNLMIATDGR 

CCACGCGTTGATGTTTgtaagtgaacatttgtatgagaggactttcotga 3575 
P R V D V 



ttq ctaoc tcaatgcQgACCAAAGAGTGCGTGTGCACTTTGCGCCAGCCA 
YQRVRVHFAPA 



3625 



ACTTAGACCCAACCGCAATCAACACTGTCTACTCACATGTATTGAACCCA 3675 
NLD PTAINTVYSHVLNP 

ATGGTAGCATCACGATCATACATGTGTATTGAGCCAGTCGAGCTCCATCT 3725 
MVAS RSYMCIEPVELHL 

CGATGTGCATGCTCTGCGACGCGGCCTCGTCATGCCCGTTGACCCTGACA 3775 
DVHALRRGLVMPVDPD 

CAGAGCCCAACGCTTTGCTCGTCCAAGACTCGGGCATGGTGCCAGTGAGC 3825 
TEPNALLVQDSGMVPV S 

ACGCAAATATCCATTGTCAACCCAGAGACCAACCAACTGTGCTTGAACGG 3875 
TQISIVNPE TNQLCLNG 

CGAGTACGGCG AGATCTGGGTGCAGTC CGAGGCGAATGCTTATAGCTTCT 3925 
Cores 

ACATGTCGAAAGAGCGCTTGGATGCAGAACGCTTCAATG6GAGGACGATT 3975 
YMSKERLDAERFNGRT I 

GACGGAGACCCAAATGTGCGATATGTTCGTACAG GCGAT TTAGGATTTTT 1025 

D G 0 P N V R ^H^BBSnBSSL 

Core 4 

GCACAGCGTGACACGGCCCATTGGACCCAACGGTGCACCTGTTGATATGC 4075 
HSVTRPIGPNGAP VDM 

AGGTGCTTTTCGTGCTTGGAAGCATAGGTGACACTTTTGAAGTCAACGGA 1 1 25 
QVLFVL6SIG0TFEVNG 

CTGAACCATTTCTCTATGGACATTGAGCAGTCTGTTGAACGTTGTCACCG 4 1 75 
LNHFSMDIEQSVERCHR 
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GAATATTGTCCCTGGAGGCTGgtacgtttcttcgottcgctgttotttag 4225 
N I V P G G C 

toaatacttactaocactctocagTGCTGTTTTCCAGGCAGGTGGGCTTG 4275 
AVFQAGGL 

TTGTTGTCGTTGTGGAAATCTTCCGACGCAACTTCCTCGCAAGCATGGTG 4325 
VVVVYEIFRRNFLASMV 

CCTGTGATTGTCAATGCAATTTTGAACGAGCATCAGCTGGTCATTGACAT 4375 
PVIVNAILNEHQLVIDI 

TGTCTCGTTTGTGCAAAAGGGCGACTTCCACC6GTCTCGTCTGGGCGAGA 4425 
VSFVQKGOFHRSRLGE 

AGCAACGCGGAAAGATTCTTGCAGGATGGGTCACACGGAAGATGCGCACA 4475 
KQRGKILAGWVTRKMRT 

ATAGCCCAGTACAGTATACGGGATCCTAATGGACAGGATTCCCAGATGAT 4525 

lAQYSIROPN ^SBBHSi n I 

Cora 6 

CACGGAAGAGCCTGGTCCACGGGCTAGCATGACTGGAAGTATGCTTGGGC 4575 
TEEPGPRASMTGSMLG 

GAATGGGCGGCCCAGCCAGTATCAAGGCCGGGTCGACAAGAGCACCGAGT 4625 
RMGGPASIKAGSTRAPS 

CTAATGGGCATGACAGCGACTATGAATAATCTATCCCTTACACAGCAGCA 4675 
LMGMTATMNNLSLTQQQ 

ACAGCAGCAATACCAACAGCCGGGTATGTATGCTCAACAGCAAGGCATGC 4725 
QQQYQQPGMYAQQQGM 

ACCCCCAGCAACAACACCAATTTAGCATGTCCAACACGCCACCACAAGGT 4775 
HPQQQHQFSMSNTPPQG 

CCACCCCAAGGCGTAGAACTACATGATCCTAGCGACCGCACACCAACAGA 4825 
P PQGVELHOPSORTPTD 

CAACCGGCACTCTTTCCTTGCCGACCCGCGTATGCAGAACCA'GGGCCAAA 4876 
NRHSFLAOPRHQNQGQ 

TGAACGAGACGGGCGCCTACGAACCCATGAACTATCAAAACGCGTATCAT 4925 
MNETGAYEPMNYQNAYH 

CCGCATCAACAACAATACGAATCTGAAGACGGGGGGAGCAGACTCAGCGG 4975 
PHQQQYESEDGGSRLSG 
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CCCCGTGCCAGACGTGCTGCGGCCGGGICCIICATXCGGGTCCATAGAGC 6026 
P V P D V L R P I G P S S G I S I E 

AGCACGACCAAGCTAACAACGACAACAATATGTGGAATAATCGCGAGTAC 5075 
QHDQAN NDNNMWNNREY 

TATGGTAACAGCCCATCGTATGCAGGCGGATACACGC AAGATGGCAAT AT 5 1 25 
YGNSPSYAGGYTQDGN I 

CCACGAGCAGCAACAACACGATGAGTACACGAGTAATGCGTCATATGGCG SI 75 
HEQQQHOEYTS NASYG 

GAAATCAAGGAGCAGGCGGAGGCAGCGGCGGCGGTGGCGGTCTCCGAGTT 6225 
GNQGAG66SGGGGGLRV 

GCAAATCGTGACAGCTCCGACAGCGAGGGTGCAGATGACGACGCTTGGAG 5275 
ANRDSSDSEGADDOAWR 

ACGTGATGCCCTTGCTCAGATCAATTTTGCGGGCGGCGCTGGTGCTGCCT 5325 
RDALAQINFAGGA AAA 

CCGCTGGAGCACCTGCTGCTGGTGCTTCTTCTTCGCAGCCGGGCCATGCG 5375 
SAGAPAAGASSSQPGHA 

CA GTAGA CGGGATATGCGTGAGTTTTTTTTTAAATTTCGTACATAGAGAC 5^t25 
Q END 

CGTTGTATACGCAGGTTTCAAATTAGAAGAGCGAATATGCATATCAGCTG 5475 

TTGTTCAATGTTCTAGTTTGGGAAGGTTAACCCCCCCCCCTTCCCCTTCC 5525 

AAGACTTTTCACTTGTTTGTGTGTGATTTAAATCTGGAGATTTCAAATCT 5575 

ACATCTCGCTATACATAGGTGTTGTTTGATAACGTAGGGGGCAGAAGGGT 5625 

ATCTCGTGATATTAGACTGGGAGTTGCATGAATCAAGGTGTTGAGCAAAA 5675 

AAAGAGAGAGCGGTGAAGGGCGGGGGGGATAGGTGGTGTGCACGTGGCTG 5725 
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AAGCCAGGTACGCATCTGCGCAGGGCCCGGCAAGCCCAGTATGACGTTTT -286 

TACCCCAGAGCCAGCCTGCTCCTCGGCTTGGCGCCGCCTGGCTTAGTCAG -236 

CCCCCATCAGTCAGCGGCCAGTCACGT6TTCGGCGC6ACAAGCTCCCACA - 1 86 

TGGCCGAT6ATCATTGTGCCTCG6CTCATGCGCGTCTGCGCTGCTTTGAC - 1 36 
GTGCCCCGGAAATGACGACGCCCAAAGTCTCACACAGCCGCTCGCTGCTG -86 



CTGCTTGCATCTGGTCTTGCTACACT6GCCTCCTTTTCGCA6AACACAGA -36 

ACGATCCACCAACAGG>M>MCr47CAGCCACCACCM£6CCACCCTCATC 15 

M A T L I 

CGCCCCCCGCCCGCCGACCACTCCCAGTCGGCCATTGAGAACGTCCTT6A 65 
RPPPADHSQSAIENVLE 

ACTGACGCAGCTGGCCGACATTGATCCCAACATCTTCACAAACACTCGTC 1 15 
LTQLAOIDPNIFTNTR 

CCCTATGGCATCCCCCCGGTGCGCGCGGCATCTTTGGCGGTGCTGCCATT 1 65 
PLWHPPGARGIFGGAAI 

GCCCA6ACACTCAGCGCCGCCCAGAAGACGGTTGACCCCGACTTCACTGT 2 1 5 
AQTLSAAQKTVDPDFT^i 

GCATTC GATGCACTGCTACTTCATCCTTGCAGGCAACTCCGAGAtCCCCG 265 
^^^^ MHCYFILAGNSEIP 

TCATATACCATGTAGAAAGGGTGCGCTCGGGAAAGTCGTTCGCAACCAGG 3 1 5 
VIYHVERVRSGKSFATR 

ACGGTGCAGGCCCGGCAAAGGGGCAACGTCATCTTCACTACGACCATGAG 365 
TYQARQRGNVrFTTTMS 

TTTTGTTCGCCAGAACAGTGGAGGCGCTCAAAAGGTCGAGCACATCTACC 4 1 5 
FVRQNSG6AQKVEH I Y 

CGATGCCAGACGTCCCGGCGCCCAAGGAAGGTAGCGACGACTTGAAGACG 465 
PMPOVPAPKEGSDDLKT 

CCCAACGATGGCCAGAGTCCCTTCCAGACCCAGCTGTTGCCCATTGAAAA 5 1 5 
PNOGQSPFQTQLLPIEN 

CGgtgagtqacttcgcotaccot ccttaog cctgccctagctcotQtcta 565 
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gCAGACGACTCCGACAAGCCCCACACCAAGAAATGCCGACAATGGATAAA 615 
ADDSDKPHTKKCRQWIK 

GGCTCGCGGTAAAATCTCCCCCGCTGGCGGTCACGAAGCCCATCTCTCCG 665 
ARGK I SPAGGHEAHLS 

CCATCGC AT ACATGTCCGATAGCTACTTCATCGGCACCGTCGCGCGTGCG 7 1 5 
AIAYMSOSYFIGTVARA 

CACAAGCTTCTGCGCTACTCGAACCAGCGCAAGAGCAGGGCCAGGTCGAG 765 
HKLLRYSNQRKSRARSS 

C ATCGACGAGGACGT ACTTAAGAAGCTGCTCGAGATGG ATGATGCCGAGT 8 1 5 
IDEDVLKKLL EMDOAE 

TACAGCGCCAAAGCTTTGTCAAC6AATCAGACAAGCAGCGCATACGTGAA 865 
L Q R Q S F V N E S D K Q R I R E 

TTGA6GAAAGCAGAAGACCTGGCAAAGAGCGGAGAC6CAAAGCCTGAGAT 9 1 5 
LRKAEDLAKSGDAKPE I 

TGGCATGATGGTTAGTCTGGACCACACCATCTACTTTCACAATCCTCGCA 965 
GMMVSLDHTIYFHNPR 

GTTTCCGCGCAGATGAATGGATCTTCACCGAGATGGAGACCCCTTGGGCT 1015 
SFRADEWIFTEHETPWA 

GGTGATGGTCGCGGCCTCGTTTCCCAGAGAATGTATACCAAGGATGGCAC 1 065 
G DGRGLVSQRMY TKDGT 

GCTCATTGCCAGCTGCGTCCAAGAGGTAA6CAGCTT6CTTATGTCTGCT A 1115 
LIASCVQEVSSLLMSA 

TAGTAGACTGTGCTTACATGAATAGGGCGTCATACGATIEAAGCAGAACG 1 165 
IVDCAYMNRASYDEND 

AAAGTAAGCTATAGCCATGACCGTGTGTAGATTGCTCGTATGTCCGTGTA 1215 
CTTTGATTCGATTGACTTTTATGCCGCCGCCTTTCACTTGACCTTGTCGG 1 265 

AGTAGACTCCGCTGTTTCTATTGTTTTAGTAGCTTTAGATTTGGAGGTCT 1315 



GGAACACACGT ACCTCACGTATACACTTTTAAATACAAGGTTCCTCGTT A 1 365 

CAAATAGCAACTCGCACTTGTGTAGTACACGACTTGGCACAGCAAGTGCA 1^15 

TGCAGATGATGCGCACTGCATGCAGCCCTAGAAGCCTGCACCTGTGCTGT 1 ^65 

TCGTCCTTCCCACGTTACAGGCCAGGGTCCAATTAGCGCAATGCCGGTCT 1515 

CGGGCTAGCATGGCCCTCACACTTGGATTCTCCGAAGCTACCGTCGGCAG 1 565 
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I I No protest accompanied the payment of additional search fees. 
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INTERNATIONAL SEARCH REPORT 



tnternttional application No. 
PCT/US0O/asfiS7 



A- CLASSIFICATION OF SUBJECT MATTER 
US CL : 

«5/,.0... *I9, ♦6.-. «.6; S00/.«. ,0a. «*. M5. M6. MT.,. MT.4, «0. «0.>. «0.,. «0., 

BOX n. OBSERVATIONS WHERE UNITY OF INVENTION WAS INCHING 
ThU ISA found muUipl* invention! u followi: 

■ . • , X. « .«^*MS driwn to DNA of SEQ ID NO:*l. vector, trintformed hott cell, trinigenic plint. ud 

°;r:-jr::XT:T^:t^'^r^^^ b. .x«nin.d to ... th., on seq m no.,. 

Group II. ciam(.) 6-8. f"'*" '"^fi^rSTEQ ID NO:45. vector, tr^rformed ho.t cell. tr«„ge„ic pl«.. 

rra^i^ " ««nined to ...nt .e. re.d on SE^ ID Na«. 

Croup IV. .l.in.(.) fT:'°''7"ll^DL'^f''s?^^ Na«. vector. .r«..formed ho., cell, tr^.genic pl«.t. 

r;:; 1Z^:::^oJ:^^^^^ .o t^. e,tent .e, re«l on SE^ ID 

Group VI. cliiin(i) drawn to protein of SEQ ID NO:*6. 

Group VII. claim(a) 45-48. drawn to inhibitor sekction method. 



nM. • y...A « GrouDi I II III IV V, VI and VII do not relate to a single inventive concept under PCT 

Surrbr-lrund" Pct'^rI .'.^^ ..c^ .orre.pondi„g .p.ci. ..cKniC fe.t-r.. for the 

following ««on^: ,^„„.d ;„ .he di.cl.»r.. «d "ene. .h. of 

n I ™«re...-tiJly «.y DNAfrom . pl«it pithogen encoding . cyclic peptide .ynthet«.. NAol.k.y. (Gene 
Te7J.ZZZ)T:^ '^^^^ irolued'froj'p.^t p^hogen. (.ee entire .rticle> Hence, there „ no 

IcilTLhi^cJ feature under PCT Rule which link, the DNA of Group I with the protem of Gro«p II. 
^ the St^To Group III «.d V. «.d the protein, of Group IV «d VI dilTer in con.po.it.on 
.he DNA of Group I the protein of Group II. re.pectively. «.d hence .re not .o linked by . .p.c.J technicd 
?:t„°X °r.'ic" . «d'con.ider.tio.. would be required for .x«nin.tion of .„h of the nucl..c .c.d .e,uence. or 

"""" " AlVoThethibitor ..le.rion method of Group VII i. di.tinct fron. the pl-.t tr^.form.ti.n method of Group I 

•*"'^^T^"e:^^ri::Jnr.'^ IV. v. VI .nd VH do not reUt. to . .ingle inv.nHve concept 



under PCT Rule 13.1. 
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INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/USOO/38««7 



A, CLASSIFICATION OF SUBJECT MATTER: 
US CL : 



«5/3«0.l, 418. 419. 468; «6/«-«. «3.6; 800/«79, 306. 514. 3l5. 316. 317.2. 317.4. 320. 320.1, 320.9. 320.3 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple invention* mb follows: 

Group I. cIdm(.)>-*.»*-**.*W3, dr.wn to DNA ofSEQ ID NO:*., vector .r«.form.d ho., c.ll. tr«»g.nic plu.t. «d 
pl.„t .r^.roro«ion method. a.im. «.d 4«WS will b. exmin.d to the extent the, re«J on SEQ ID N0:*1. 

Group II. cliimfe) 6-8, driwB to protein of SEQ ID NO:48. , ^ „. ■ , . 

CrZ clumf 1 iS-**.*9S». drawn to DNA of SEQ ID NO:+S. vector, tra.formed ho.t cell. tru..Ren.c pluit. 
° d pC;r2mt;lon L.h^d. O^. «-4* «d will be examined to the extent the, read on SEQ ID NO=«. 

Group IV. claim(s) 14-16, drawn to protein of SEQ ID Na44. , ^ ^ „ . • , , 

G^ V. cl«m(.) 17.21.25.44.49..3. drawn to DNA of SEQ ID NO:45 vector, transformed host cell, ^r-g^-^^^^^^^ 
and pLt transformation method, aaim. 26-44 and 49-63 will be exammed to the extent they read on SEQ ID Na46. 
Group VI, claim(s) drawn to protein of SEQ ID NO:46. 

Group VII. claim(s) 46-48. drawn to inhibitor selection method. 



The inventions listed as Groups I. U. III. IV. V. VI and VII do not relate to a single inventive concept under PCT 
Rule 13.1 because, under PCT Rule 13.2. they lack the same or corresponding sptciMl technical features for the 

following reoons.^^ ^CPSr and "stringent conditions" are not well defined in the disclosure, and hence the claims of 
Group I encompass essentially any DNA from a plant pathogen encoding a cyclic peptide synthetase. Nikol.kaya (Gene 
166- 207-211 1996) teaches various CPS genes isolated from plant pathogens (see entire article). Hence, there is no 
special technical feature under PCT Rule 13.2 which links the DNA of Group I with the protem of Group II. 

The DNAs of Group III and V. and the proteins of Group IV and VI differ in composition and structure from 
the DNA of Group 1 and the protein of Group II. respectively. «id hence are not so linked by a special technical 
feature. Separate searches and considerations would be required for ex«nination of each of the nucleic acid sequences or 

amino acid sequences. ^ m c r> t 

Also, the inhibitor selection method of Group VII is distinct from the plant transformation method of Group I 

in starting materials, method steps, and end products. . , . ■ 

Therefore, the invention, of Groups I. II. III. IV. V. VI and VII do not relate to a smgle inventive concept 

under PCT Rule 13.1. 
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